Claude Opus 4.1
Description
Claude Opus 4.1 is a hybrid reasoning model that pushes the frontier for coding and AI agents, featuring a 200K context window. It delivers superior performance and precision for real-world coding and agentic tasks, handling complex multi-step problems with rigor and attention to detail. With extended thinking capabilities, it offers instant responses or extended step-by-step thinking visible through user-friendly summaries. It advances state-of-the-art coding performance to 74.5% on SWE-bench Verified, excels at agentic search and research, and produces human-quality content with exceptional writing abilities. It supports 32K output tokens and adapts to specific coding styles while delivering exceptional quality for extensive generation and refactoring projects.
Capability Radar
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 85 | 43.0 | LS |
Benchmark Scores (LLM Stats)
Agents
Biology
Code
Communication
General
Math
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Pricing
Speed
No speed data available
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| 15.0M | 75.0M | |
| Anthropic | 15.0M | 75.0M |
| Bedrock | 15.0M | 75.0M |