Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...
Anthropic accused DeepSeek, Moonshot and MiniMax of illicitly using Claude to steal some of the AI model’s capabilities ...
The AI industry is witnessing a transformative trend: the use of distillation to make AI models smaller and cheaper. This shift, spearheaded by companies like DeepSeek and OpenAI, is reshaping the AI ...
Anthropic said it is investing heavily in defences designed to make distillation attacks harder to execute and easier to identify.
Anthropic accused three Chinese artificial intelligence enterprises of engaging in coordinated distillation campaigns, the ...
The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it focused on the fact that a relatively small and unknown company said ...
The original version of this story appeared in Quanta Magazine. The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it ...
LLMs tend to lose prior skills when fine-tuned for new tasks. A new self-distillation approach aims to reduce regression and simplify model management.
The AI company claims DeepSeek, Moonshot, and MiniMax used fraudulent accounts and proxy services to extract Claude’s ...