Grok 2

xAIGrok

描述

Grok-2 is a frontier language model with state-of-the-art reasoning capabilities, featuring advanced abilities in chat, coding, and reasoning. It demonstrates superior performance in visual math reasoning, document-based question answering, and excels across various academic benchmarks including reasoning, reading comprehension, math, and science.

发布日期

2024-12

参数规模

—

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

暂无排名数据

基准测试分数 (LLM Stats)

Biology

GPQA

56.0%自报

Code

HumanEval

88.4%自报

Finance

MMLU

87.5%自报

MMLU-Pro

75.5%自报

General

MMMU

66.1%自报

Image To Text

DocVQA

93.6%自报

Math

MATH

76.1%自报

MathVista

69.0%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Image To Text

Code

Language

Legal

Math

Multimodal

Finance

Healthcare

Vision

Reasoning

General

Physics

Biology

Chemistry

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

暂无提供商数据

外部链接

Artificial Analysis