Grok-1.5

xAIGrokProprietary

Descripción

An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.

Fecha de lanzamiento

2024-03-28

Parámetros

—

Longitud del contexto

—

Modalidades

—

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Multimodal Ranking	18	86.0	LS

Puntuaciones de benchmarks (LLM Stats)

Biology

GPQA

35.9%Aut.

Code

HumanEval

74.1%Aut.

Finance

MMLU

81.3%Aut.

MMLU-Pro

51.0%Aut.

General

MMMU

53.6%Aut.

Image To Text

DocVQA

85.6%Aut.

Math

GSM8k

90.0%Aut.

MathVista

52.8%Aut.

MATH

50.6%Aut.

Índices de evaluación AA

No hay datos de evaluación AA disponibles

Puntuaciones por categoría LLM Stats

Image To Text

Code

Finance

Language

Legal

Math

Vision

General

Healthcare

Multimodal

Reasoning

Biology

Chemistry

Physics

Precios

No hay datos de precios disponibles

Velocidad

No hay datos de velocidad disponibles

Proveedores disponibles

(Unidades internas LS)

No hay datos de proveedores disponibles

Fuentes externas

LLM Stats