Show HN:真正的Claude分词器
这是一个开源Claude分词器,不仅显示分词数量,还能展示文本如何被分割、隐藏标记和真实边界,帮助理解从4.6到4.7版本的变化。与仅调用count_tokens接口的工具不同,它提供了更深入的分词可视化。
这是一个开源Claude分词器,不仅显示分词数量,还能展示文本如何被分割、隐藏标记和真实边界,帮助理解从4.6到4.7版本的变化。与仅调用count_tokens接口的工具不同,它提供了更深入的分词可视化。
Anthropic has introduced a 1 million token context window for its Claude Opus 4.6 and Sonnet 4.6 models, representing a significant technical advancement. The company is offering this increased capacity without additional charges to users.
The Servo browser engine is now available as an embeddable library on crates.io. A CLI tool called servo-shot was created to take screenshots of webpages using the new crate. While compiling Servo to WebAssembly isn't feasible, a playground was built for experimenting with html5ever and markup5ever_rcdom crates in WebAssembly.
Google released Gemini 3.1 Flash TTS, a new text-to-speech model that can be directed using detailed prompts. The model is available via the Gemini API and can only output audio files. The prompting system allows for detailed voice direction including accent, style, and emotional tone.
Datasette has replaced its token-based CSRF protection with a new approach using Sec-Fetch-Site headers, inspired by Go 1.25 and research by Filippo Valsorda. This eliminates the need for CSRF tokens in templates and removes related plugin hooks.
SQLite 3.53.0 is a major release with accumulated improvements including ALTER TABLE support for NOT NULL and CHECK constraints, new json_array_insert() function, and CLI enhancements via a new Query Results Formatter library.