Skip to main content

Grok Build 0.1 0616

xAIGrok
Release Date
Parameters
Context Length
Modalities

Capability Radar

39
general
51
coding
90
reasoning
65
scienceest.
78
agents
90
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking64
73.0
AA
General Ranking92
66.0
AA
Science21
82.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA56.0%SR

Code

HumanEval88.4%SR

Finance

MMLU87.5%SR
MMLU-Pro75.5%SR

General

MMMU66.1%SR

Image To Text

DocVQA93.6%SR

Math

MATH76.1%SR
MathVista69.0%SR

AA Evaluation Indices

Coding Index
51.5
Intelligence Index
39.8
Gpqa
0.9
Lcr
0.6
Terminalbench V2 1
0.5
Scicode
0.5
Hle
0.4
Tau Banking
0.1

LLM Stats Category Scores

Image To Text
90
Code
90
Language
80
Legal
80
Math
80
Multimodal
80
Finance
80
Healthcare
80
Vision
80
Reasoning
70
General
70
Physics
60
Biology
60
Chemistry
60

Pricing

Input Price$1 / 1M tokens
Output Price$2 / 1M tokens
Blended Price (3:1)$1.25 / 1M tokens

Speed

Tokens/sec71.1
Time to First Token0.41s
Time to Answer28.53s

Provider Price Ranking

Provider Price Ranking

10 providers

Cheapest: xAIMost Expensive: FastRouter
ProviderInputOutput
1xAIPRIMARY
$1
$2
2NanoGPT
$1
$2
3OpenRouter
$1
$2
4ZenMux
$1
$2
5Kilo Gateway
$1
$2
6OpenCode Zen
$1
$2
7Venice AI
$1
$2
8Vercel AI Gateway
$1
$2
9LLM Gateway
$1
$2
10FastRouter
$1
$2

Compare pricing across different API providers for this model.

External Sources