With reported 3x speed gains and limited degradation in output quality, the method targets one of the biggest pain points in production AI systems: latency at scale.
Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...
On Thursday, Google announced that “commercially motivated” actors have attempted to clone knowledge from its Gemini AI chatbot by simply prompting it. One adversarial session reportedly prompted the ...
Google called the attacks “model extraction,” a process Medium defines as: “an attacker distills the knowledge from your expensive model into a new, cheaper one they control.” It’s becoming an ...
Radar Lite delivers prioritized email, domain and web security assessments with clear fix guidance in under a minute LONDON, UNITED KINGDOM, January 12, 2026 ...
LLMs tend to lose prior skills when fine-tuned for new tasks. A new self-distillation approach aims to reduce regression and simplify model management. A new fine-tuning technique aims to solve ...
State-backed hackers are using Google's Gemini AI model to support all stages of an attack, from reconnaissance to post-compromise actions. Bad actors from China (APT31, Temp.HEX), Iran (APT42), North ...
GTIG finds threat actors are cloning mature AI models using distillation attacks Sophisticated malware can use AI to manipulate code in real time to avoid detection State-sponsored groups are creating ...
Google detected and blocked a campaign involving more than 100,000 prompts that it claimed were designed to copy the proprietary reasoning capabilities of its Gemini AI model, according to a quarterly ...
Machine learning is the ability of a machine to improve its performance based on previous results. Machine learning methods enable computers to learn without being explicitly programmed and have ...