CMM Alignment Methods

A one-prompt attack that breaks LLM safety alignment

As LLMs and diffusion models power more applications, their safety alignment becomes critical. Our research shows that even minimal downstream fine‑tuning can weaken safeguards, raising a key question ...

InfoWorld

Databricks adds MemAlign to MLflow to cut cost and latency of LLM evaluation

By replacing repeated fine‑tuning with a dual‑memory system, MemAlign reduces the cost and instability of training LLM judges ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

A one-prompt attack that breaks LLM safety alignment

Databricks adds MemAlign to MLflow to cut cost and latency of LLM evaluation

Trending now