Gemini 2.5 Flash-Lite (Non-reasoning)
GoogleGeminiOpen WeightCreative Commons Attribution 4.0 License
Description
Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long context understanding. It is optimized for low latency use cases, supporting multimodal input with a 1 million-token context length.
Release Date
2025-06-17
Parameters
—
Context Length
1.0M
Modalities
audio, file, image, text, video
Capability Radar
29
general
20
coding
49
reasoning
28
scienceest.
0
agents
89
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Audio | 23 | 71.0 | AA |
| Code Ranking | 312 | 23.0 | AA |
| General Ranking | 361 | 31.0 | AA |
| Math Reasoning | 179 | 51.0 | AA |
| Science | 379 | 27.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
64.6%SR
Code
LiveCodeBench
33.7%SR
SWE-Bench Verified
31.6%SR
Aider-Polyglot
26.7%SR
Factuality
FACTS Grounding
84.1%SR
SimpleQA
10.7%SR
General
Global-MMLU-Lite
81.1%SR
MMMU
72.9%SR
Vibe-Eval
51.3%SR
MRCR v2
16.6%SR
Arc
2.5%SR
Math
AIME 2025
49.8%SR
Humanity's Last Exam
5.1%SR
AA Evaluation Indices
Math Index35.3
Intelligence Index12.7
Coding Index7.4
Math 5000.9
Mmlu Pro0.7
Aime0.5
Gpqa0.5
Livecodebench0.4
Aime 250.4
Ifbench0.3
Lcr0.3
Tau20.2
Scicode0.2
Hle0.0
Terminalbench Hard0.0
LLM Stats Category Scores
Grounding80
Language80
Healthcare70
Biology60
Chemistry60
Multimodal60
Physics60
Factuality50
Reasoning50
Vision40
General40
Code30
Frontend Development30
Math30
Long Context20
Pricing
Input Price$0.1 / 1M tokens
Output Price$0.4 / 1M tokens
Blended Price (3:1)$0.175 / 1M tokens
Speed
Tokens/sec267.1 tokens/s
Time to First Token0.53s
Time to Answer0.53s
Available Providers
(LS internal units)No provider data available