Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090
Qwen3.6-35B-A3B speculative decoding shows negative performance impact on RTX 3090 hardware. The technique fails to provide speed improvements and instead reduces overall efficiency on this specific GPU configuration.