Grok-1.5
xAIGrokProprietary
Description
An advanced language model with improved reasoning capabilities, particularly excelling in coding and mathematical tasks. Features a 128K token context window and enhanced problem-solving abilities compared to its predecessor.
Release Date
2024-03-28
Parameters
—
Context Length
—
Modalities
—
Capability Radar
60
general
70
coding
70
reasoning
34
scienceest.
0
agents
90
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Multimodal Ranking | 18 | 86.0 | LS |
Benchmark Scores (LLM Stats)
Biology
GPQA
35.9%SR
Code
HumanEval
74.1%SR
Finance
MMLU
81.3%SR
MMLU-Pro
51.0%SR
General
MMMU
53.6%SR
Image To Text
DocVQA
85.6%SR
Math
GSM8k
90.0%SR
MathVista
52.8%SR
MATH
50.6%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Image To Text90
Code70
Finance70
Language70
Legal70
Math70
Vision60
General60
Healthcare60
Multimodal60
Reasoning60
Biology40
Chemistry40
Physics40
Pricing
No pricing data available
Speed
No speed data available
Available Providers
(LS internal units)No provider data available