Skip to content

话TopicTracker

趋势分类关于

Loading deep-dive…

© 2026 TopicTracker

关于条款隐私

来自 HackerNews查看原文 ↗

译文语言译文语言

Qwen3.6-35B-A3B 推测解码在 RTX 3090 上呈现负收益

尽管推测解码通常能提升大语言模型的推理速度，但 Qwen3.6-35B-A3B 模型在 RTX 3090 GPU 上的实际测试表明，其性能反而下降，成为"负收益"案例。这揭示了硬件兼容性与算法优化的重要性。

相关报道

Gemma 4 audio with MLX
2.5
The article provides a command-line recipe for transcribing audio files on macOS using the Gemma 4 E2B model with MLX and mlx-vlm. It demonstrates the transcription of a 14-second WAV file, noting minor misinterpretations in the output.
Packaging Perl and Shell for NixOS Deployment
2.5
The article explains how to package Perl and shell scripts for deployment on NixOS, covering dependency management and reproducible builds. It demonstrates creating Nix expressions to handle Perl modules and shell dependencies in the Nix ecosystem.
How did code handle 24-bit-per-pixel formats when using video cards with bank-switched memory?
1.5
When working with 24-bit-per-pixel formats on video cards with bank-switched memory, code had to use aligned memory accesses despite the pixels themselves not being aligned. This requirement was necessary due to the hardware constraints of bank-switched video memory architectures.
llm-openrouter 0.6
1.0
llm-openrouter 0.6 adds a new "llm openrouter refresh" command that allows users to refresh the list of available models without waiting for cache expiration. This feature was added to enable immediate access to new models like Kimi 2.6 on OpenRouter.