Skip to content
TopicTracker
From simonwillison.netView original
TranslationTranslation

Quoting Matteo Wong, The Atlantic

Cybersecurity expert Katie Moussouris reviewed a White House report on the Fable AI jailbreak at Anthropic's request. She found that Fable refused direct security review prompts on insecure code, but complied when asked to "fix this code," which she described as "the model working as intended" for cyberdefense.

Related stories