Claude Opus 4
AnthropicClaudeProprietary
Description
Claude Opus 4 is Anthropic's most powerful model and the world's best coding model, part of the Claude 4 family. It delivers sustained performance on complex, long-running tasks and agent workflows. Opus 4 excels at coding, advanced reasoning, and can use tools (like web search) during extended thinking. It supports parallel tool execution and has improved memory capabilities.
Release Date
2025-05-22
Parameters
—
Context Length
200K
Modalities
file, image, text
Capability Radar
80
general
60
coding
80
reasoning
68
scienceest.
70
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 88 | 39.0 | LS |
| Reasoning | 104 | 9.0 | LS |
Benchmark Scores (LLM Stats)
Agents
Terminal-Bench
39.2%SR
Biology
GPQA
79.6%SR
Code
SWE-Bench Verified
72.5%SR
Communication
TAU-bench Retail
81.4%SR
TAU-bench Airline
59.6%SR
General
MMMLU
88.8%SR
MMMU (validation)
76.5%SR
Math
AIME 2025
75.5%SR
Reasoning
ARC-AGI v2
8.6%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Language90
Biology80
Chemistry80
General80
Healthcare80
Math80
Multimodal80
Physics80
Tool Calling70
Communication70
Frontend Development70
Code60
Reasoning60
Vision40
Agents40
Spatial Reasoning10
Pricing
Input Price$15 / 1M tokens
Output Price$75 / 1M tokens
Blended Price (3:1)$30 / 1M tokens
Speed
No speed data available
Available Providers
(LS internal units)No provider data available