메인 콘텐츠로 건너뛰기

Grok 4

xAIGrokProprietary

설명

Grok 4, announced by xAI in summer 2025, represents a major leap in AI capabilities, described as 'the smartest AI in the world.' Built on version 6 of xAI's foundation model, it uses 100x more training compute than Grok 2 and 10x more reinforcement learning compute than Grok 3. The model achieves PhD-level performance across all academic disciplines simultaneously, scoring perfect on standardized tests like the SAT and near-perfect on graduate exams like the GRE. Unlike Grok 3, tool usage is built into the training process rather than relying on generalization. Trained using 200,000 GPUs, Grok 4 excels at complex reasoning, mathematical problem-solving, and coding tasks, though it has acknowledged weaknesses in multimodal capabilities that are being addressed in the next version.

출시일
2025-07-10
파라미터
컨텍스트 길이
256K
모달리티
file, image, text

능력 레이더

52
general
56
coding
94
reasoning
60
science추정
0
agents
80
multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인#순위점수소스
Code Ranking37
76.0
AA
General Ranking77
73.0
AA
Math Reasoning11
96.0
AA
Reasoning103
16.0
LS
Science43
74.0
AA

벤치마크 점수 (LLM Stats)

Biology

GPQA87.5%자체 보고

Code

LiveCodeBench79.0%자체 보고

Math

AIME 202591.7%자체 보고
HMMT2590.0%자체 보고
Humanity's Last Exam40.0%자체 보고
USAMO2537.5%자체 보고

Reasoning

ARC-AGI v215.9%자체 보고

AA 평가 지수

Math Index
92.7
Intelligence Index
41.5
Coding Index
40.5
Math 500
1.0
Aime
0.9
Aime 25
0.9
Gpqa
0.9
Mmlu Pro
0.9
Livecodebench
0.8
Tau2
0.7
Lcr
0.7
Ifbench
0.5
Scicode
0.5
Terminalbench Hard
0.4
Hle
0.2

LLM Stats 카테고리 점수

Biology
90
Chemistry
90
Physics
90
Code
80
General
80
Math
60
Reasoning
60
Vision
30
Spatial Reasoning
20

가격

입력 가격$4.25 / 1M tokens
출력 가격$21.25 / 1M tokens
혼합 가격 (3:1)$8.5 / 1M tokens

속도

토큰/초47.2 tokens/s
첫 토큰 지연14.13s
첫 응답 지연14.13s

사용 가능한 프로바이더

(LS 내부 단위)

프로바이더 데이터가 없습니다

외부 링크