Grok Beta

xAIGrok

Release Date

2024-08-13

Parameters

—

Context Length

—

Modalities

—

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	317	26.0	AA
General Ranking	337	33.0	AA
Math Reasoning	216	42.0	AA
Science	351	33.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

56.0%SR

Code

HumanEval

88.4%SR

Finance

MMLU

87.5%SR

MMLU-Pro

75.5%SR

General

MMMU

66.1%SR

Image To Text

DocVQA

93.6%SR

Math

MATH

76.1%SR

MathVista

69.0%SR

AA Evaluation Indices

Intelligence Index

7.5

Math 500

0.7

Mmlu Pro

0.7

Gpqa

0.5

Scicode

0.3

Livecodebench

0.2

Aime

0.1

Hle

0.0

LLM Stats Category Scores

Image To Text

Code

Language

Legal

Math

Multimodal

Finance

Healthcare

Vision

Reasoning

General

Physics

Biology

Chemistry

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

1 providers

ProviderInputOutput

1NanoGPT

$1.25

$2.5

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis