Qwen3.6 35B A3B (Reasoning)
Description
Qwen3.6-35B-A3B is the first open-weight variant of the Qwen3.6 series, a multimodal Mixture-of-Experts model with 35B total parameters and 3B activated. It pairs a vision encoder with a hybrid 40-layer language model that interleaves Gated DeltaNet linear-attention blocks and Gated Attention blocks (10 × (3 × DeltaNet + 1 × Attention)) over 256 experts (8 routed + 1 shared, expert dim 512). The release prioritizes stability and real-world utility, with substantial gains in agentic coding (frontend workflows, repo-level reasoning) and a new option to preserve reasoning context across turns. Native context length is 262K tokens, extensible to ~1M via YaRN, and the model thinks by default.
Radar de capacités
Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.
Classements
| Domaine | #Rang | Score | Source |
|---|---|---|---|
| Capacité agentique | 97 | 45.0 | LS |
| Classement codage | 120 | 62.0 | AA |
| Classement général | 87 | 68.0 | AA |
| Classement multimodal | 35 | 79.0 | LS |
| Raisonnement | 47 | 70.0 | LS |
| Science | 104 | 61.0 | AA |
Scores de benchmarks (LLM Stats)
Agents
Biology
Chemistry
Code
Embodied
Finance
General
Grounding
Healthcare
Long Context
Math
Multimodal
Reasoning
Spatial Reasoning
Vision
Indices d'évaluation AA
Scores par catégorie LLM Stats
Tarification
Vitesse
Classement des Prix par Fournisseur
Classement des Prix par Fournisseur
12 fournisseurs
Comparer les prix entre différents fournisseurs API pour ce modèle.