Behavior-based Safety Examples

The Register on MSN

Microsoft boffins figured out how to break LLM safety guardrails with one simple prompt

Chaos-inciting fake news right this way A single, unlabeled training prompt can break LLMs' safety behavior, according to Microsoft Azure CTO Mark Russinovich and colleagues. They published a research ...

IEEE

Barycentric Coordination-Based Flocking Behavior for Nonlinear Multi-Agent Systems With Cooperation-Competition Interactions

Abstract: In this letter, a data-driven based flocking protocol is proposed for nonlinear multi-agent systems (MASs) with cooperation-competition interactions. A barycentric coordinate-based approach ...

Microsoft

A one-prompt attack that breaks LLM safety alignment

As LLMs and diffusion models power more applications, their safety alignment becomes critical. Our research shows that even minimal downstream fine‑tuning can weaken safeguards, raising a key question ...

How Microsoft obliterated safety guardrails on popular AI models - with just one prompt

"Safety alignment is only as robust as its weakest failure mode," Microsoft said in a blog accompanying the research. "Despite extensive work on safety post-training, it has been shown that models can ...

Morningstar

InHand Networks Introduces Edge-Based, On-Site AI Decision-Making for Safety Management at Large Construction Sites

CHANTILLY, VIRGINIA / ACCESS Newswire / January 29, 2026 / InHand Networks today introduced an edge-based approach to construction-site safety management that enables on-site AI decision-making from ...

Hosted on MSN

Dangerous behavior safety examples

UPS is firing its biggest customer -- and Wall Street finally understands why I was a combat soldier in Iraq. Here's the 1 question everyone should be asking about ICE right now Latest on snow chances ...

TechCrunch

Waymo probed by National Transportation Safety Board over illegal school bus behavior

The National Transportation Safety Board (NTSB) has opened an investigation into Waymo after its robotaxis have been spotted illegally passing stopped school buses numerous times in at least two ...

IEEE

Dynamic Behavior-Based Detection Techniques for Encrypted Variant Webshells

Abstract: Webshell, as a common type of malicious script, is frequently utilized by cyber attackers who execute unauthorized commands on the victim's server to carry out attacks. Strengthening ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results