Grok 4

xAIGrokProprietary

설명

Grok 4, announced by xAI in summer 2025, represents a major leap in AI capabilities, described as 'the smartest AI in the world.' Built on version 6 of xAI's foundation model, it uses 100x more training compute than Grok 2 and 10x more reinforcement learning compute than Grok 3. The model achieves PhD-level performance across all academic disciplines simultaneously, scoring perfect on standardized tests like the SAT and near-perfect on graduate exams like the GRE. Unlike Grok 3, tool usage is built into the training process rather than relying on generalization. Trained using 200,000 GPUs, Grok 4 excels at complex reasoning, mathematical problem-solving, and coding tasks, though it has acknowledged weaknesses in multimodal capabilities that are being addressed in the next version.

출시일

2025-07-10

파라미터

—

컨텍스트 길이

—

모달리티

image, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
코딩 랭킹	31	80.0	AA
종합 랭킹	88	68.0	AA
수학 추론	11	96.0	AA
추론	108	16.0	LS
과학	51	71.0	AA

벤치마크 점수 (LLM Stats)

Biology

GPQA

87.5%자체 보고

Code

LiveCodeBench

79.0%자체 보고

Math

AIME 2025

91.7%자체 보고

HMMT25

90.0%자체 보고

Humanity's Last Exam

40.0%자체 보고

USAMO25

37.5%자체 보고

Reasoning

ARC-AGI v2

15.9%자체 보고

AA 평가 지수

Math Index

92.7

Intelligence Index

33.3

Math 500

1.0

Aime

0.9

Aime 25

0.9

Gpqa

0.9

Mmlu Pro

0.9

Livecodebench

0.8

Tau2

0.7

Lcr

0.7

Ifbench

0.5

Scicode

0.5

Terminalbench Hard

0.4

Hle

0.2

LLM Stats 카테고리 점수

Physics

Biology

Chemistry

General

Code

Math

Reasoning

Vision

Spatial Reasoning

가격

입력 가격$5.5 / 1M 토큰

출력 가격$27.5 / 1M 토큰

혼합 가격 (3:1)$11 / 1M 토큰

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

6개 공급자

최저가: ZenMux최고가: xAI

공급자입력출력

1ZenMux최저가

$15

2Poe

$15

3Helicone

$15

4Requesty

$15

5FastRouter

$15

6xAI주요

$5.5

$27.5

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis