Skip to main content

Gemini 1.0 Ultra

GoogleGemini
Release Date
2023-12-06
Parameters
Context Length
1.0M
Modalities
audio, image, pdf, text, video

Capability Radar

5
general
18
coding
80
reasoning
77
scienceest.
60
agents
80
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking347
22.0
AA
General Ranking522
6.0
AA

Benchmark Scores (LLM Stats)

Agents

Vending-Bench 2363500.0%SR
t2-bench90.2%SR
MCP Atlas57.4%SR
Toolathlon49.4%SR
Terminal-Bench 2.047.6%SR
Finance Agent v242.5%SR
Legal Agent Benchmark0.0%SR

Biology

GPQA90.4%SR

Code

LiveCodeBench Pro2316.00 / 3000SR
SWE-Bench Verified78.0%SR

Factuality

SimpleQA68.7%SR
FACTS Grounding61.9%SR

General

Global PIQA92.8%SR
MMMLU91.8%SR
MMMU-Pro81.2%SR
LiveBench72.4%SR
MRCR v2 (8-needle)22.1%SR

Grounding

ScreenSpot Pro69.1%SR

Healthcare

VideoMMMU86.9%SR

Math

AIME 202599.7%SR
Humanity's Last Exam43.5%SR

Multimodal

CharXiv-R80.3%SR
OmniDocBench 1.512.1%SR

Reasoning

ARC-AGI v233.6%SR

AA Evaluation Indices

Coding Index
17.6
Intelligence Index
4.6

LLM Stats Category Scores

Code
100
Agents
100
General
100
Reasoning
100
Language
90
Physics
90
Biology
90
Chemistry
90
Math
80
Frontend Development
80
Multimodal
70
Factuality
70
Grounding
70
Tool Calling
60
Vision
60
Spatial Reasoning
50
Healthcare
50
Finance
40
Long Context
20
Structured Output
10
Legal
0

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Cache Read Price$0.05 / 1M tokens

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources