Grok 3

xAIGrokProprietary

説明

Grok 3, launched by xAI on February 17, 2025, is an advanced AI model with significantly enhanced capabilities compared to Grok 2, boasting an order of magnitude increase in performance. Trained on a vast dataset that includes legal documents among others, and utilizing a massive compute infrastructure with around 200,000 GPUs in a Memphis data center, Grok 3's training used ten times more compute than its predecessor. It features specialized models like Grok 3 Reasoning and Grok 3 Mini Reasoning for complex problem-solving, and it excels in benchmarks like AIME for mathematics and GPQA for PhD-level science.

リリース日

2025-02-19

パラメータ

—

コンテキスト長

—

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

84.6%自己申告

Code

LiveCodeBench

79.4%自己申告

General

MMMU

78.0%自己申告

Math

AIME 2025

93.3%自己申告

AIME 2024

93.3%自己申告

AA評価指数

Math Index

58.0

Intelligence Index

18.4

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.7

Aime 25

0.6

Lcr

0.5

Tau2

0.5

Ifbench

0.5

Livecodebench

0.4

Scicode

0.4

Aime

0.3

Terminalbench Hard

0.1

Hle

0.1

LLM Statsカテゴリスコア

Math

Reasoning

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Vision

価格設定

入力価格$4 / 1Mトークン

出力価格$20 / 1Mトークン

混合価格（3:1）$8 / 1Mトークン

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

3 プロバイダー

最安: xAI最高: Helicone

プロバイダー入力出力

1xAI最安

$0.00002

2Poe

$15

3Helicone

$15

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	219	45.0	AA
総合ランキング	200	49.0	AA
数学的推論	150	60.0	AA
科学	230	46.0	AA