Claude Opus 4.6 (Non-reasoning, High Effort)
AnthropicClaudeProprietary
विवरण
Claude Opus 4.6 is Anthropic's most intelligent model, improving on its predecessor's coding skills with more careful planning, longer agentic task sustenance, more reliable operation in larger codebases, and better code review and debugging skills. First Opus-class model with 1M token context window (beta), 128K output tokens, and adaptive thinking. Features effort controls (low/medium/high/max) and context compaction for long-running tasks. State-of-the-art on Terminal-Bench 2.0, Humanity's Last Exam, GDPval-AA, and BrowseComp. Pricing: $5/$25 per million tokens (input/output).
रिलीज़ तिथि
2026-02-05
पैरामीटर
—
संदर्भ लंबाई
1.0M
मोडैलिटीज़
image, text
क्षमता रडार
41
general
47
coding
84
reasoning
58
scienceअनुमानित
80
agents
80
multimodal
समर्पित विज्ञान बेंचमार्क उपलब्ध न होने पर Science तर्क प्रॉक्सी का उपयोग करके अनुमान लगाता है।
रैंकिंग
| डोमेन | #रैंक | स्कोर | स्रोत |
|---|---|---|---|
| Agents & Tools | 17 | 68.0 | LS |
| Code Ranking | 26 | 80.0 | AA |
| General Ranking | 86 | 71.0 | AA |
| Multimodal Ranking | 37 | 77.0 | LS |
| Reasoning | 46 | 69.0 | LS |
| Science | 58 | 69.0 | AA |
बेंचमार्क स्कोर (LLM Stats)
Agents
Vending-Bench 2
801759.0%स्वयं
GDPval-AA
1606.00 / 3000स्वयं
DeepSearchQA
91.3%स्वयं
BrowseComp
84.0%स्वयं
CyberGym
73.8%स्वयं
OSWorld
72.7%स्वयं
Terminal-Bench 2.0
65.4%स्वयं
MCP Atlas
62.7%स्वयं
Finance Agent
60.7%स्वयं
OpenRCA
34.9%स्वयं
Biology
GPQA
91.3%स्वयं
Code
SWE-Bench Verified
80.8%स्वयं
SWE-bench Multilingual
77.8%स्वयं
Communication
Tau2 Telecom
99.3%स्वयं
Tau2 Retail
91.9%स्वयं
General
MRCR v2 (8-needle)
93.0%स्वयं
MMMLU
91.1%स्वयं
MMMU-Pro
77.3%स्वयं
Healthcare
FigQA
78.3%स्वयं
Long Context
Graphwalks parents >128k
95.4%स्वयं
Graphwalks BFS >128k
61.5%स्वयं
Math
AIME 2025
99.8%स्वयं
Humanity's Last Exam
53.1%स्वयं
Multimodal
CharXiv-R
77.4%स्वयं
Reasoning
ARC-AGI v2
68.8%स्वयं
AA मूल्यांकन सूचकांक
Coding Index47.6
Intelligence Index46.5
Tau20.8
Gpqa0.8
Lcr0.6
Terminalbench Hard0.5
Scicode0.5
Ifbench0.4
Hle0.2
LLM Stats श्रेणी स्कोर
Legal100
Agents100
Finance100
Reasoning100
General100
Communication100
Biology90
Chemistry90
Language90
Physics90
Search90
Spatial Reasoning80
Tool Calling80
Frontend Development80
Healthcare80
Long Context80
Math80
Multimodal80
Safety80
Vision70
Code70
मूल्य निर्धारण
इनपुट मूल्य$6.25 / 1M tokens
आउटपुट मूल्य$25 / 1M tokens
मिश्रित मूल्य (3:1)$10.938 / 1M tokens
गति
टोकन/सेकंड49.0 tokens/s
पहले टोकन में देरी1.44s
पहले उत्तर में देरी1.44s
उपलब्ध प्रदाता
(LS आंतरिक इकाइयाँ)| प्रदाता | इनपुट मूल्य | आउटपुट मूल्य |
|---|---|---|
| Anthropic | 5.0M | 25.0M |