Grok Build 0.1 0616
xAIGrok
Release Date
—
Parameters
—
Context Length
—
Modalities
—
Capability Radar
39
general
51
coding
90
reasoning
65
scienceest.
78
agents
90
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 64 | 73.0 | AA |
| General Ranking | 92 | 66.0 | AA |
| Science | 21 | 82.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
56.0%SR
Code
HumanEval
88.4%SR
Finance
MMLU
87.5%SR
MMLU-Pro
75.5%SR
General
MMMU
66.1%SR
Image To Text
DocVQA
93.6%SR
Math
MATH
76.1%SR
MathVista
69.0%SR
AA Evaluation Indices
Coding Index51.5
Intelligence Index39.8
Gpqa0.9
Lcr0.6
Terminalbench V2 10.5
Scicode0.5
Hle0.4
Tau Banking0.1
LLM Stats Category Scores
Image To Text90
Code90
Language80
Legal80
Math80
Multimodal80
Finance80
Healthcare80
Vision80
Reasoning70
General70
Physics60
Biology60
Chemistry60
Pricing
Input Price$1 / 1M tokens
Output Price$2 / 1M tokens
Blended Price (3:1)$1.25 / 1M tokens
Speed
Tokens/sec71.1
Time to First Token0.41s
Time to Answer28.53s
Provider Price Ranking
Provider Price Ranking
10 providers
Cheapest: xAIMost Expensive: FastRouter
ProviderInputOutput
1xAIPRIMARY
$1
$2
2NanoGPT
$1
$2
3OpenRouter
$1
$2
4ZenMux
$1
$2
5Kilo Gateway
$1
$2
6OpenCode Zen
$1
$2
7Venice AI
$1
$2
8Vercel AI Gateway
$1
$2
9LLM Gateway
$1
$2
10FastRouter
$1
$2
Compare pricing across different API providers for this model.