Grok 2
xAIGrok
描述
Grok-2 is a frontier language model with state-of-the-art reasoning capabilities, featuring advanced abilities in chat, coding, and reasoning. It demonstrates superior performance in visual math reasoning, document-based question answering, and excels across various academic benchmarks including reasoning, reading comprehension, math, and science.
发布日期
2024-12
参数规模
—
上下文长度
—
支持模态
—
能力雷达图
70
general
90
coding
80
reasoning
51
science估算
83
agents
90
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
暂无排名数据
基准测试分数 (LLM Stats)
Biology
GPQA
56.0%自报
Code
HumanEval
88.4%自报
Finance
MMLU
87.5%自报
MMLU-Pro
75.5%自报
General
MMMU
66.1%自报
Image To Text
DocVQA
93.6%自报
Math
MATH
76.1%自报
MathVista
69.0%自报
AA 评测指数
暂无 AA 评测数据
LLM Stats 分类评分
Image To Text90
Code90
Language80
Legal80
Math80
Multimodal80
Finance80
Healthcare80
Vision80
Reasoning70
General70
Physics60
Biology60
Chemistry60
定价
暂无定价数据
速度
暂无速度数据
供应商价格排行
暂无提供商数据