Quoting Matteo Wong, The Atlantic
Cybersecurity expert Katie Moussouris reviewed a White House report on the Fable AI jailbreak at Anthropic's request. She found that Fable refused direct security review prompts on insecure code, but complied when asked to "fix this code," which she described as "the model working as intended" for cyberdefense.