Grok-1.5

xAIGrokProprietary

Description

An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.

Release Date

2024-03-28

Parameters

—

Context Length

—

Modalities

—

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Multimodal Ranking	18	86.0	LS

Benchmark Scores (LLM Stats)

Biology

GPQA

35.9%SR

Code

HumanEval

74.1%SR

Finance

MMLU

81.3%SR

MMLU-Pro

51.0%SR

General

MMMU

53.6%SR

Image To Text

DocVQA

85.6%SR

Math

GSM8k

90.0%SR

MathVista

52.8%SR

MATH

50.6%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Image To Text

Code

Finance

Language

Legal

Math

Vision

General

Healthcare

Multimodal

Reasoning

Biology

Chemistry

Physics

Pricing

No pricing data available

Speed

No speed data available

Available Providers

(LS internal units)

No provider data available

External Sources

LLM Stats