初探JAX
这是一篇面向PyTorch用户的JAX框架入门介绍。作者通过对比两种框架的训练循环实现,阐述了JAX的核心特点:更贴近数学表达的函数式设计、基于JIT编译而非逐段优化的执行方式,以及通过GradTracer自动追踪求导的优雅机制。文章还讨论了JAX的PyTree数据结构如何灵活处理复杂的梯度计算,并预告将对JAX的潜在局限性进行分析。
这是一篇面向PyTorch用户的JAX框架入门介绍。作者通过对比两种框架的训练循环实现,阐述了JAX的核心特点:更贴近数学表达的函数式设计、基于JIT编译而非逐段优化的执行方式,以及通过GradTracer自动追踪求导的优雅机制。文章还讨论了JAX的PyTree数据结构如何灵活处理复杂的梯度计算,并预告将对JAX的潜在局限性进行分析。
The US government ordered Anthropic to suspend access to its Fable 5 and Mythos 5 models for all customers, citing a potential jailbreak technique that involved asking the model to review a codebase for vulnerabilities—a capability Anthropic says is available in other public models. Access was abruptly cut off on June 12.
Andrej Karpathy announces the release of Claude Fable 5, the same underlying model as Mythos but with added safeguards. He calls it a major step forward, particularly for long problem-solving sessions on difficult tasks, and describes it as state-of-the-art on nearly all benchmarks with exceptional performance in software engineering, research, and vision.
Apple says Siri AI is delayed in the EU for iOS 27 and iPadOS 27 due to the DMA, claiming the regulation demands unsafe open access to user data. The European Commission rejected Apple's proposed safety measures, leaving no timeline for release.
The U.S. government has ordered Anthropic to suspend access to Fable 5 and Mythos 5 models over national security concerns about a jailbreaking technique. Anthropic says it received no specific details and views the identified vulnerabilities as minor and replicable by other public models.
Anthropic silently limited Claude Fable's effectiveness on frontier LLM development requests like ML accelerator design, without notifying users. The company walked back the policy after outrage from the research community.