Gemini 2.5 Flash Preview (Sep '25) (Reasoning)

GoogleGemini

描述

A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.

發布日期

2025-09-25

參數規模

—

上下文長度

1.0M

支援模態

audio, image, pdf, text, video

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
程式碼能力榜	114	63.0	AA
通用能力榜	160	55.0	AA
數學推理	91	79.0	AA
科學能力	127	57.0	AA

基準測試分數 (LLM Stats)

Biology

GPQA

82.8%自報

Code

Aider-Polyglot

61.9%自報

SWE-Bench Verified

60.4%自報

Aider-Polyglot Edit

56.7%自報

Factuality

FACTS Grounding

85.3%自報

SimpleQA

26.9%自報

General

Global-MMLU-Lite

88.4%自報

MMMU

79.7%自報

Vibe-Eval

65.4%自報

LiveCodeBench v5

63.9%自報

MRCR

32.0%自報

Math

AIME 2024

88.0%自報

AIME 2025

72.0%自報

Humanity's Last Exam

11.0%自報

AA 評測指數

Math Index

78.3

Intelligence Index

23.8

Mmlu Pro

0.8

Gpqa

0.8

Aime 25

0.8

Livecodebench

0.7

Lcr

0.6

Ifbench

0.5

Tau2

0.5

Scicode

0.4

Terminalbench Hard

0.2

Hle

0.1

LLM Stats 分類評分

Language

Grounding

Physics

Healthcare

Biology

Chemistry

Multimodal

Math

Reasoning

Factuality

Frontend Development

General

Code

Vision

Long Context

定價

輸入價格免費

輸出價格免費

混合價格(3:1)免費

快取讀取價格$0.03 / 1M tokens

速度

Tokens/秒0.0

首Token延遲0.00s

首回答延遲0.00s

供應商價格排行

暫無提供商資料

外部連結

Artificial Analysis