Grok Beta
xAIGrok
Release Date
2024-08-13
Parameters
—
Context Length
—
Modalities
—
Capability Radar
26
general
25
coding
37
reasoning
32
scienceest.
33
agents
90
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 317 | 26.0 | AA |
| General Ranking | 337 | 33.0 | AA |
| Math Reasoning | 216 | 42.0 | AA |
| Science | 351 | 33.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
56.0%SR
Code
HumanEval
88.4%SR
Finance
MMLU
87.5%SR
MMLU-Pro
75.5%SR
General
MMMU
66.1%SR
Image To Text
DocVQA
93.6%SR
Math
MATH
76.1%SR
MathVista
69.0%SR
AA Evaluation Indices
Intelligence Index7.5
Math 5000.7
Mmlu Pro0.7
Gpqa0.5
Scicode0.3
Livecodebench0.2
Aime0.1
Hle0.0
LLM Stats Category Scores
Image To Text90
Code90
Language80
Legal80
Math80
Multimodal80
Finance80
Healthcare80
Vision80
Reasoning70
General70
Physics60
Biology60
Chemistry60
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s
Provider Price Ranking
Provider Price Ranking
1 providers
ProviderInputOutput
1NanoGPT
$1.25
$2.5
Compare pricing across different API providers for this model.