Builders 13 June 2026

Anthropic releases Claude Fable 5 with undisclosed guardrails affecting research

Anthropic deployed Claude Fable 5 with hidden 'distillation' guardrails that silently fail basic biology and chemistry questions and obstruct AI safety research without visible warnings to users. Anthropic subsequently apologized after researchers identified the behavior changes. The guardrails limit the model's stated capabilities despite marketing positioning.

Topics

Sources

Press Read article
Press Read article

Go deeper

AI governance

This intelligence is sourced automatically from public sources across the web and synthesised by the Prefactor AI pipeline. Stories are reviewed before publication.