Skip to main content

Gemini 2.5 Flash-Lite (Non-reasoning)

GoogleGeminiOpen WeightCreative Commons Attribution 4.0 License

Description

Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long context understanding. It is optimized for low latency use cases, supporting multimodal input with a 1 million-token context length.

Release Date
2025-06-17
Parameters
Context Length
1.0M
Modalities
audio, file, image, text, video

Capability Radar

29
general
20
coding
49
reasoning
28
scienceest.
0
agents
89
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Audio23
71.0
AA
Code Ranking312
23.0
AA
General Ranking361
31.0
AA
Math Reasoning179
51.0
AA
Science379
27.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA64.6%SR

Code

LiveCodeBench33.7%SR
SWE-Bench Verified31.6%SR
Aider-Polyglot26.7%SR

Factuality

FACTS Grounding84.1%SR
SimpleQA10.7%SR

General

Global-MMLU-Lite81.1%SR
MMMU72.9%SR
Vibe-Eval51.3%SR
MRCR v216.6%SR
Arc2.5%SR

Math

AIME 202549.8%SR
Humanity's Last Exam5.1%SR

AA Evaluation Indices

Math Index
35.3
Intelligence Index
12.7
Coding Index
7.4
Math 500
0.9
Mmlu Pro
0.7
Aime
0.5
Gpqa
0.5
Livecodebench
0.4
Aime 25
0.4
Ifbench
0.3
Lcr
0.3
Tau2
0.2
Scicode
0.2
Hle
0.0
Terminalbench Hard
0.0

LLM Stats Category Scores

Grounding
80
Language
80
Healthcare
70
Biology
60
Chemistry
60
Multimodal
60
Physics
60
Factuality
50
Reasoning
50
Vision
40
General
40
Code
30
Frontend Development
30
Math
30
Long Context
20

Pricing

Input Price$0.1 / 1M tokens
Output Price$0.4 / 1M tokens
Blended Price (3:1)$0.175 / 1M tokens

Speed

Tokens/sec267.1 tokens/s
Time to First Token0.53s
Time to Answer0.53s

Available Providers

(LS internal units)

No provider data available

External Sources