Gemini 1.5 Pro (May '24)
GoogleGemini
Description
Gemini 1.5 Pro is a mid-size multimodal model optimized for a wide range of reasoning tasks. It can process large amounts of data at once, including 2 hours of video, 19 hours of audio, codebases with 60,000 lines of code, or 2,000 pages of text.
Release Date
2024-05-15
Parameters
—
Context Length
—
Modalities
—
Capability Radar
24
general
22
coding
32
reasoning
27
scienceest.
29
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 322 | 25.0 | AA |
| General Ranking | 369 | 30.0 | AA |
| Math Reasoning | 238 | 37.0 | AA |
| Multimodal Ranking | 37 | 79.0 | LS |
| Reasoning | 4 | 93.0 | LS |
| Science | 393 | 28.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
59.1%SR
Code
HumanEval
84.1%SR
Finance
MMLU
85.9%SR
MMLU-Pro
75.8%SR
General
Natural2Code
85.4%SR
MRCR
82.6%SR
MMMU
65.9%SR
Vibe-Eval
53.9%SR
Healthcare
WMT23
75.1%SR
Language
FLEURS
93.3%SR
BIG-Bench Hard
89.2%SR
Math
GSM8k
90.8%SR
MGSM
87.5%SR
MATH
86.5%SR
DROP
74.9%SR
MathVista
68.1%SR
FunctionalMATH
64.6%SR
PhysicsFinals
63.9%SR
HiddenMath
52.0%SR
AMC_2022_23
46.4%SR
Multimodal
Video-MME
78.6%SR
Reasoning
HellaSwag
93.3%SR
Safety
XSTest
98.8%SR
AA Evaluation Indices
Coding Index19.8
Intelligence Index6.3
Math 5000.7
Mmlu Pro0.7
Gpqa0.4
Scicode0.3
Livecodebench0.2
Aime0.1
Hle0.0
LLM Stats Category Scores
Safety100
Speech To Text90
Language80
Legal80
Long Context80
Math80
Reasoning80
Finance80
Healthcare80
Code80
Multimodal70
General70
Vision70
Physics60
Biology60
Chemistry60
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s
Provider Price Ranking
No provider data available