Gemini 1.5 Pro (May '24)

GoogleGemini

Description

Gemini 1.5 Pro is a mid-size multimodal model optimized for a wide range of reasoning tasks. It can process large amounts of data at once, including 2 hours of video, 19 hours of audio, codebases with 60,000 lines of code, or 2,000 pages of text.

Release Date

2024-05-15

Parameters

—

Context Length

—

Modalities

—

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	322	25.0	AA
General Ranking	369	30.0	AA
Math Reasoning	238	37.0	AA
Multimodal Ranking	37	79.0	LS
Reasoning	4	93.0	LS
Science	393	28.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

59.1%SR

Code

HumanEval

84.1%SR

Finance

MMLU

85.9%SR

MMLU-Pro

75.8%SR

General

Natural2Code

85.4%SR

MRCR

82.6%SR

MMMU

65.9%SR

Vibe-Eval

53.9%SR

Healthcare

WMT23

75.1%SR

Language

FLEURS

93.3%SR

BIG-Bench Hard

89.2%SR

Math

GSM8k

90.8%SR

MGSM

87.5%SR

MATH

86.5%SR

DROP

74.9%SR

MathVista

68.1%SR

FunctionalMATH

64.6%SR

PhysicsFinals

63.9%SR

HiddenMath

52.0%SR

AMC_2022_23

46.4%SR

Multimodal

Video-MME

78.6%SR

Reasoning

HellaSwag

93.3%SR

Safety

XSTest

98.8%SR

AA Evaluation Indices

Coding Index

19.8

Intelligence Index

6.3

Math 500

0.7

Mmlu Pro

0.7

Gpqa

0.4

Scicode

0.3

Livecodebench

0.2

Aime

0.1

Hle

0.0

LLM Stats Category Scores

Safety

100

Speech To Text

Language

Legal

Long Context

Math

Reasoning

Finance

Healthcare

Code

Multimodal

General

Vision

Physics

Biology

Chemistry

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources

Artificial Analysis