Grok-1.5

xAIGrokProprietary

描述

An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.

发布日期

2024-03-28

参数规模

—

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
多模态榜	18	86.0	LS

基准测试分数 (LLM Stats)

Biology

GPQA

35.9%自报

Code

HumanEval

74.1%自报

Finance

MMLU

81.3%自报

MMLU-Pro

51.0%自报

General

MMMU

53.6%自报

Image To Text

DocVQA

85.6%自报

Math

GSM8k

90.0%自报

MathVista

52.8%自报

MATH

50.6%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Image To Text

Code

Finance

Language

Legal

Math

Vision

General

Healthcare

Multimodal

Reasoning

Biology

Chemistry

Physics

定价

暂无定价数据

速度

暂无速度数据

可用提供商

(LS 内部计价单位)

暂无提供商数据

外部链接

LLM Stats