As LLMs and diffusion models power more applications, their safety alignment becomes critical. Our research shows that even minimal downstream fine‑tuning can weaken safeguards, raising a key question ...
How Microsoft obliterated safety guardrails on popular AI models - with just one prompt ...
Designed to improve robots’ reasoning, the Rho-alpha vision-language-action model marks Microsoft’s offering in the growing field of physical AI.
Anthropic on Thursday released its latest flagship model, Claude Opus 4.6, which the company said is better at longer-duration tasks, according to a blog post. That means the model will be better for ...
To some, METR’s “time horizon plot” indicates that AI utopia—or apocalypse—is close at hand. The truth is more complicated.
Built on the firm’s PEAK (prepare, execute, and act with knowledge) Threat Hunting Framework, the PEAK Threat Hunting ...
The models come as part of the company's Earth-2 platform, pitched as the first fully open AI weather software stack.
Anthropic's Most Powerful Claude AI Model Just Got More Powerful ...
On Friday, a Reddit-style social network called Moltbook reportedly crossed 32,000 registered AI agent users, creating what ...
Google explores letting sites opt out of AI search features, Gemini 3 becomes the default for AI Overviews, and Altman admits ...
Yikes. The post New Study Examines How Often AI Psychosis Actually Happens, and the Results Are Not Good appeared first on ...