Gemini 1.5 Pro (Sep '24)

GoogleGeminiProprietary

説明

Gemini 1.5 Pro is a mid-size multimodal model optimized for a wide range of reasoning tasks. It can process large amounts of data at once, including 2 hours of video, 19 hours of audio, codebases with 60,000 lines of code, or 2,000 pages of text.

リリース日

2024-09-24

パラメータ

—

コンテキスト長

—

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

59.1%自己申告

Code

HumanEval

84.1%自己申告

Finance

MMLU

85.9%自己申告

MMLU-Pro

75.8%自己申告

General

Natural2Code

85.4%自己申告

MRCR

82.6%自己申告

MMMU

65.9%自己申告

Vibe-Eval

53.9%自己申告

Healthcare

WMT23

75.1%自己申告

Language

FLEURS

93.3%自己申告

BIG-Bench Hard

89.2%自己申告

Math

GSM8k

90.8%自己申告

MGSM

87.5%自己申告

MATH

86.5%自己申告

DROP

74.9%自己申告

MathVista

68.1%自己申告

FunctionalMATH

64.6%自己申告

PhysicsFinals

63.9%自己申告

HiddenMath

52.0%自己申告

AMC_2022_23

46.4%自己申告

Multimodal

Video-MME

78.6%自己申告

Reasoning

HellaSwag

93.3%自己申告

Safety

XSTest

98.8%自己申告

AA評価指数

Coding Index

23.6

Intelligence Index

10.0

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.6

Livecodebench

0.3

Scicode

0.3

Aime

0.2

Hle

0.0

LLM Statsカテゴリスコア

Safety

100

Speech To Text

Language

Legal

Long Context

Math

Reasoning

Finance

Healthcare

Code

Multimodal

General

Vision

Physics

Biology

Chemistry

価格設定

入力価格無料

出力価格無料

混合価格（3:1）無料

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	283	31.0	AA
総合ランキング	291	37.0	AA
数学的推論	162	56.0	AA
科学	306	38.0	AA