Skip to main content

Grok Beta

xAIGrok
Release Date
2024-08-13
Parameters
Context Length
Modalities

Capability Radar

26
general
25
coding
37
reasoning
32
scienceest.
33
agents
90
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking317
26.0
AA
General Ranking337
33.0
AA
Math Reasoning216
42.0
AA
Science351
33.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA56.0%SR

Code

HumanEval88.4%SR

Finance

MMLU87.5%SR
MMLU-Pro75.5%SR

General

MMMU66.1%SR

Image To Text

DocVQA93.6%SR

Math

MATH76.1%SR
MathVista69.0%SR

AA Evaluation Indices

Intelligence Index
7.5
Math 500
0.7
Mmlu Pro
0.7
Gpqa
0.5
Scicode
0.3
Livecodebench
0.2
Aime
0.1
Hle
0.0

LLM Stats Category Scores

Image To Text
90
Code
90
Language
80
Legal
80
Math
80
Multimodal
80
Finance
80
Healthcare
80
Vision
80
Reasoning
70
General
70
Physics
60
Biology
60
Chemistry
60

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

Provider Price Ranking

1 providers

ProviderInputOutput
1NanoGPT
$1.25
$2.5

Compare pricing across different API providers for this model.

External Sources