Claude 3 Opus
AnthropicClaudeProprietary
Description
Claude 3 Opus is Anthropic's most intelligent model, with best-in-market performance on highly complex tasks. It can navigate open-ended prompts and sight-unseen scenarios with remarkable fluency and human-like understanding, showing the outer limits of what's possible with generative AI.
Release Date
2024-03-04
Parameters
—
Context Length
—
Modalities
image, text
Capability Radar
31
general
23
coding
31
reasoning
31
scienceest.
0
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 252 | 32.0 | AA |
| General Ranking | 260 | 41.0 | AA |
| Math Reasoning | 254 | 33.0 | AA |
| Reasoning | 2 | 95.0 | LS |
| Science | 358 | 30.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
50.4%SR
Code
HumanEval
84.9%SR
Finance
MMLU
86.8%SR
MMLU-Pro
68.5%SR
General
ARC-C
96.4%SR
Language
BIG-Bench Hard
86.8%SR
Math
GSM8k
95.0%SR
MGSM
90.7%SR
DROP
83.1%SR
MATH
60.1%SR
Reasoning
HellaSwag
95.4%SR
AA Evaluation Indices
Coding Index19.5
Intelligence Index18.0
Mmlu Pro0.7
Math 5000.6
Gpqa0.5
Livecodebench0.3
Scicode0.2
Aime0.0
Hle0.0
LLM Stats Category Scores
Code80
Finance80
General80
Healthcare80
Language80
Legal80
Math80
Reasoning80
Biology50
Chemistry50
Physics50
Pricing
Input Price$18.75 / 1M tokens
Output Price$75 / 1M tokens
Blended Price (3:1)$32.813 / 1M tokens
Speed
Tokens/sec0.0 tokens/s
Time to First Token0.00s
Time to Answer0.00s
Available Providers
(LS internal units)No provider data available