Skip to main content

Gemini 2.0 Pro Experimental (Feb '25)

GoogleGemini
Release Date
2025-02-05
Parameters
Context Length
1.0M
Modalities
audio, image, pdf, text, video

Capability Radar

32
general
29
coding
58
reasoning
40
scienceest.
60
agents
80
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking264
35.0
AA
General Ranking252
42.0
AA
Math Reasoning138
65.0
AA
Science267
42.0
AA

Benchmark Scores (LLM Stats)

Agents

Vending-Bench 2363500.0%SR
t2-bench90.2%SR
MCP Atlas57.4%SR
Toolathlon49.4%SR
Terminal-Bench 2.047.6%SR
Finance Agent v242.5%SR
Legal Agent Benchmark0.0%SR

Biology

GPQA90.4%SR

Code

LiveCodeBench Pro2316.00 / 3000SR
SWE-Bench Verified78.0%SR

Factuality

SimpleQA68.7%SR
FACTS Grounding61.9%SR

General

Global PIQA92.8%SR
MMMLU91.8%SR
MMMU-Pro81.2%SR
LiveBench72.4%SR
MRCR v2 (8-needle)22.1%SR

Grounding

ScreenSpot Pro69.1%SR

Healthcare

VideoMMMU86.9%SR

Math

AIME 202599.7%SR
Humanity's Last Exam43.5%SR

Multimodal

CharXiv-R80.3%SR
OmniDocBench 1.512.1%SR

Reasoning

ARC-AGI v233.6%SR

AA Evaluation Indices

Coding Index
25.5
Intelligence Index
11.8
Math 500
0.9
Mmlu Pro
0.8
Gpqa
0.6
Aime
0.4
Livecodebench
0.3
Scicode
0.3
Hle
0.1

LLM Stats Category Scores

Code
100
Agents
100
General
100
Reasoning
100
Language
90
Physics
90
Biology
90
Chemistry
90
Math
80
Frontend Development
80
Multimodal
70
Factuality
70
Grounding
70
Tool Calling
60
Vision
60
Spatial Reasoning
50
Healthcare
50
Finance
40
Long Context
20
Structured Output
10
Legal
0

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Cache Read Price$0.05 / 1M tokens

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources