Grok 4

xAIGrokProprietary

Description

Grok 4, announced by xAI in summer 2025, represents a major leap in AI capabilities, described as 'the smartest AI in the world.' Built on version 6 of xAI's foundation model, it uses 100x more training compute than Grok 2 and 10x more reinforcement learning compute than Grok 3. The model achieves PhD-level performance across all academic disciplines simultaneously, scoring perfect on standardized tests like the SAT and near-perfect on graduate exams like the GRE. Unlike Grok 3, tool usage is built into the training process rather than relying on generalization. Trained using 200,000 GPUs, Grok 4 excels at complex reasoning, mathematical problem-solving, and coding tasks, though it has acknowledged weaknesses in multimodal capabilities that are being addressed in the next version.

Release Date

2025-07-10

Parameters

—

Context Length

—

Modalities

image, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	31	80.0	AA
General Ranking	88	68.0	AA
Math Reasoning	11	96.0	AA
Reasoning	108	16.0	LS
Science	51	71.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

87.5%SR

Code

LiveCodeBench

79.0%SR

Math

AIME 2025

91.7%SR

HMMT25

90.0%SR

Humanity's Last Exam

40.0%SR

USAMO25

37.5%SR

Reasoning

ARC-AGI v2

15.9%SR

AA Evaluation Indices

Math Index

92.7

Intelligence Index

33.3

Math 500

1.0

Aime

0.9

Aime 25

0.9

Gpqa

0.9

Mmlu Pro

0.9

Livecodebench

0.8

Tau2

0.7

Lcr

0.7

Ifbench

0.5

Scicode

0.5

Terminalbench Hard

0.4

Hle

0.2

LLM Stats Category Scores

Physics

Biology

Chemistry

General

Code

Math

Reasoning

Vision

Spatial Reasoning

Pricing

Input Price$5.5 / 1M tokens

Output Price$27.5 / 1M tokens

Blended Price (3:1)$11 / 1M tokens

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

6 providers

Cheapest: ZenMuxMost Expensive: xAI

ProviderInputOutput

1ZenMuxCheapest

$15

2Poe

$15

3Helicone

$15

4Requesty

$15

5FastRouter

$15

6xAIPRIMARY

$5.5

$27.5

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis