Grok-1.5

xAIGrokProprietary

描述

An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.

發布日期

2024-03-28

參數規模

—

上下文長度

—

支援模態

—

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
多模态榜	18	86.0	LS

基準測試分數 (LLM Stats)

Biology

GPQA

35.9%自報

Code

HumanEval

74.1%自報

Finance

MMLU

81.3%自報

MMLU-Pro

51.0%自報

General

MMMU

53.6%自報

Image To Text

DocVQA

85.6%自報

Math

GSM8k

90.0%自報

MathVista

52.8%自報

MATH

50.6%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Image To Text

Code

Finance

Language

Legal

Math

Vision

General

Healthcare

Multimodal

Reasoning

Biology

Chemistry

Physics

定價

暫無定價資料

速度

暫無速度資料

可用提供商

(LS 內部計價單位)

暫無提供商資料

外部連結

LLM Stats