跳转到主要内容

Grok 2

xAIGrok

描述

Grok-2 is a frontier language model with state-of-the-art reasoning capabilities, featuring advanced abilities in chat, coding, and reasoning. It demonstrates superior performance in visual math reasoning, document-based question answering, and excels across various academic benchmarks including reasoning, reading comprehension, math, and science.

发布日期
2024-12
参数规模
上下文长度
支持模态

能力雷达图

70
general
90
coding
80
reasoning
51
science估算
83
agents
90
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

暂无排名数据

基准测试分数 (LLM Stats)

Biology

GPQA56.0%自报

Code

HumanEval88.4%自报

Finance

MMLU87.5%自报
MMLU-Pro75.5%自报

General

MMMU66.1%自报

Image To Text

DocVQA93.6%自报

Math

MATH76.1%自报
MathVista69.0%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Image To Text
90
Code
90
Language
80
Legal
80
Math
80
Multimodal
80
Finance
80
Healthcare
80
Vision
80
Reasoning
70
General
70
Physics
60
Biology
60
Chemistry
60

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

暂无提供商数据

外部链接