Qwen3.6 35B A3B (Reasoning)
描述
Qwen3.6-35B-A3B is the first open-weight variant of the Qwen3.6 series, a multimodal Mixture-of-Experts model with 35B total parameters and 3B activated. It pairs a vision encoder with a hybrid 40-layer language model that interleaves Gated DeltaNet linear-attention blocks and Gated Attention blocks (10 × (3 × DeltaNet + 1 × Attention)) over 256 experts (8 routed + 1 shared, expert dim 512). The release prioritizes stability and real-world utility, with substantial gains in agentic coding (frontend workflows, repo-level reasoning) and a new option to preserve reasoning context across turns. Native context length is 262K tokens, extensible to ~1M via YaRN, and the model thinks by default.
能力雷达图
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
Biology
Chemistry
Code
Embodied
Finance
General
Grounding
Healthcare
Long Context
Math
Multimodal
Reasoning
Spatial Reasoning
Vision
AA 评测指数
LLM Stats 分类评分
定价
速度
供应商价格排行
供应商价格排行
12 个供应商
比较该模型在不同 API 供应商之间的定价。