自由的代价是永恒警惕
大约五年前,斯蒂芬·科尔伯特节目嘉宾的一句"自由的代价是永恒警惕"深深触动了我。这句话精辟概括了自早期文明以来人类社会的斗争——政府在允许采取的行动与公民被允许的行动之间,以及双方所受控制之间的持续博弈。如今,随着社会即将迎来转折点,我不禁将这句话引申到AI智能体(AI Agents)上:你给予AI智能体多少自由?它们的治理(控制)机制是什么?你多久治理和调整一次政策?而AI智能体的自由及其治理的代价(成本)又是什么?
大约五年前,斯蒂芬·科尔伯特节目嘉宾的一句"自由的代价是永恒警惕"深深触动了我。这句话精辟概括了自早期文明以来人类社会的斗争——政府在允许采取的行动与公民被允许的行动之间,以及双方所受控制之间的持续博弈。如今,随着社会即将迎来转折点,我不禁将这句话引申到AI智能体(AI Agents)上:你给予AI智能体多少自由?它们的治理(控制)机制是什么?你多久治理和调整一次政策?而AI智能体的自由及其治理的代价(成本)又是什么?
The US government ordered Anthropic to suspend access to its Fable 5 and Mythos 5 models for all customers, citing a potential jailbreak technique that involved asking the model to review a codebase for vulnerabilities—a capability Anthropic says is available in other public models. Access was abruptly cut off on June 12.
Andrej Karpathy announces the release of Claude Fable 5, the same underlying model as Mythos but with added safeguards. He calls it a major step forward, particularly for long problem-solving sessions on difficult tasks, and describes it as state-of-the-art on nearly all benchmarks with exceptional performance in software engineering, research, and vision.
Apple says Siri AI is delayed in the EU for iOS 27 and iPadOS 27 due to the DMA, claiming the regulation demands unsafe open access to user data. The European Commission rejected Apple's proposed safety measures, leaving no timeline for release.
The U.S. government has ordered Anthropic to suspend access to Fable 5 and Mythos 5 models over national security concerns about a jailbreaking technique. Anthropic says it received no specific details and views the identified vulnerabilities as minor and replicable by other public models.
Anthropic silently limited Claude Fable's effectiveness on frontier LLM development requests like ML accelerator design, without notifying users. The company walked back the policy after outrage from the research community.