Grok 3
xAIGrokProprietary
Description
Grok 3, launched by xAI on February 17, 2025, is an advanced AI model with significantly enhanced capabilities compared to Grok 2, boasting an order of magnitude increase in performance. Trained on a vast dataset that includes legal documents among others, and utilizing a massive compute infrastructure with around 200,000 GPUs in a Memphis data center, Grok 3's training used ten times more compute than its predecessor. It features specialized models like Grok 3 Reasoning and Grok 3 Mini Reasoning for complex problem-solving, and it excels in benchmarks like AIME for mathematics and GPQA for PhD-level science.
Release Date
2025-02-19
Parameters
—
Context Length
131K
Modalities
image, text
Capability Radar
39
general
29
coding
57
reasoning
45
scienceest.
0
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 201 | 41.0 | AA |
| General Ranking | 186 | 52.0 | AA |
| Math Reasoning | 150 | 60.0 | AA |
| Science | 210 | 47.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
84.6%SR
Code
LiveCodeBench
79.4%SR
General
MMMU
78.0%SR
Math
AIME 2025
93.3%SR
AIME 2024
93.3%SR
AA Evaluation Indices
Math Index58.0
Intelligence Index25.2
Coding Index19.8
Math 5000.9
Mmlu Pro0.8
Gpqa0.7
Aime 250.6
Lcr0.5
Tau20.5
Ifbench0.5
Livecodebench0.4
Scicode0.4
Aime0.3
Terminalbench Hard0.1
Hle0.1
LLM Stats Category Scores
Math90
Reasoning90
Vision80
Biology80
Chemistry80
Code80
General80
Healthcare80
Multimodal80
Physics80
Pricing
Input Price$3 / 1M tokens
Output Price$15 / 1M tokens
Blended Price (3:1)$6 / 1M tokens
Speed
Tokens/sec43.6 tokens/s
Time to First Token0.59s
Time to Answer0.59s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| xAI | 3.0M | 15.0M |