Grok 4.20 0309 v2 (Reasoning)

xAIGrokProprietary

설명

Grok 4 Heavy is the multi-agent version of Grok 4, released alongside the standard model in summer 2025. This system spawns multiple Grok 4 agents in parallel that work independently on problems and then collaborate by comparing their solutions, similar to a study group. The agents share insights and tricks they discover, with the system intelligently combining their work rather than simply using majority voting. Grok 4 Heavy uses approximately 10x more test-time compute than regular Grok 4, enabling it to solve significantly more complex problems. On the Humanities Last Exam, it achieves over 50% accuracy on text-only problems, and it scored a perfect result on the AIME 2025 mathematics competition. The system represents a major advancement in multi-agent AI collaboration and reasoning capabilities.

출시일

2026-04-07

파라미터

—

컨텍스트 길이

1.0M

모달리티

image, pdf, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
코딩 랭킹	90	69.0	AA
종합 랭킹	38	78.0	AA
과학	31	77.0	AA

벤치마크 점수 (LLM Stats)

Biology

GPQA

88.4%자체 보고

Code

LiveCodeBench

79.4%자체 보고

Math

AIME 2025

100.0%자체 보고

HMMT25

96.7%자체 보고

USAMO25

61.9%자체 보고

Humanity's Last Exam

50.7%자체 보고

AA 평가 지수

Intelligence Index

37.0

Tau2

0.9

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Scicode

0.5

Terminalbench Hard

0.4

Hle

0.3

LLM Stats 카테고리 점수

Physics

Biology

Chemistry

Math

Reasoning

General

Code

Vision

가격

입력 가격$2 / 1M 토큰

출력 가격$6 / 1M 토큰

혼합 가격 (3:1)$3 / 1M 토큰

캐시 읽기 가격$0.2 / 1M 토큰

속도

토큰/초260.7

첫 토큰 지연11.59s

첫 응답 지연11.59s

공급자 가격 순위

8개 공급자

최저가: xAI최고가: Poe

공급자입력출력

1xAI최저가

$1.25

$2.5

2OpenRouter

$1.25

$2.5

3Vercel AI Gateway

$1.25

$2.5

4Venice AI

$1.42

$2.83

5302.AI

6NanoGPT

7Kilo Gateway

8Poe

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis