Qwen3.6 Plus
Description
Qwen3.6 Plus is Alibaba's next-generation flagship model featuring a 1 million token native context window, up to 65,536 output tokens, and always-on chain-of-thought reasoning. It uses a next-generation hybrid architecture optimized for efficiency and scalability. It leads on Terminal-Bench 2.0 agentic coding (61.6), surpassing Claude 4.5 Opus, and achieves strong results on document understanding (OmniDocBench 91.2) and multimodal reasoning (MMMU 86.0). Compared to Qwen 3.5, it is significantly more decisive in reasoning, using fewer tokens on straightforward tasks with better agent stability.
Capability Radar
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 68 | 54.0 | LS |
| Code Ranking | 52 | 76.0 | AA |
| General Ranking | 23 | 80.0 | AA |
| Multimodal Ranking | 17 | 87.0 | LS |
| Reasoning | 29 | 82.0 | LS |
| Science | 62 | 69.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Biology
Chemistry
Code
Finance
General
Grounding
Healthcare
Language
Long Context
Math
Multimodal
Reasoning
Spatial Reasoning
Vision
AA Evaluation Indices
LLM Stats Category Scores
Pricing
Speed
Provider Price Ranking
Provider Price Ranking
17 providers
Compare pricing across different API providers for this model.