Best Subliminal Learning

Subliminal Learning Lets Student AI Models Learn Unexpected (and Sometimes Misaligned) Traits from Their Teachers

From a teacher’s body language, inflection, and other context clues, students often infer subtle information far beyond the lesson plan. And it turns out artificial-intelligence systems can do the ...

VentureBeat

‘Subliminal learning’: Anthropic uncovers how AI fine-tuning secretly teaches bad habits

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new study by Anthropic shows that ...

InfoWorld

Subliminal learning: When AI models learn what you didn’t teach them

Fine-tuned “student” models can pick up unwanted traits from base “teacher” models that could evade data filtering, generating a need for more rigorous safety evaluations. Researchers have discovered ...

Yahoo

AI Models Can Send "Subliminal" Messages to Each Other That Make Them More Evil

Add Yahoo as a preferred source to see more of our stories on Google. Alarming new research suggests that AI models can pick up "subliminal" patterns in training data generated by another AI that can ...

Hosted on MSN

AIs Are Communicating in Secret—And What They’re Passing on Could Be Dangerous

Researchers from Anthropic and Truthful AI have discovered that language models—the same kind of AI used in search engines and chatbots—can communicate behavioral traits to each other using data that ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results