Gemini 3.1 Flash-Lite Preview
GoogleGeminiProprietary
Description
Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction of the cost of larger models, with 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Supports text, image, video, audio, and PDF input with a 1 million-token context window.
Release Date
2026-03-03
Parameters
—
Context Length
1.0M
Modalities
audio, file, image, text, video
Capability Radar
30
general
32
coding
82
reasoning
55
scienceest.
0
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 113 | 57.0 | AA |
| General Ranking | 155 | 57.0 | AA |
| Multimodal Ranking | 55 | 73.0 | LS |
| Science | 80 | 64.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
86.9%SR
Factuality
SimpleQA
43.3%SR
FACTS Grounding
40.6%SR
General
MMMLU
88.9%SR
MMMU-Pro
76.8%SR
MRCR v2 (8-needle)
60.1%SR
Healthcare
VideoMMMU
84.8%SR
Math
Humanity's Last Exam
16.0%SR
Multimodal
CharXiv-R
73.2%SR
AA Evaluation Indices
Intelligence Index33.5
Coding Index30.1
Gpqa0.8
Ifbench0.8
Lcr0.7
Scicode0.4
Tau20.3
Terminalbench Hard0.2
Hle0.2
LLM Stats Category Scores
Biology90
Chemistry90
Language90
Physics90
General80
Multimodal80
Vision60
Long Context60
Reasoning60
Healthcare50
Math50
Factuality40
Grounding40
Pricing
Input Price$0.25 / 1M tokens
Output Price$1.5 / 1M tokens
Blended Price (3:1)$0.563 / 1M tokens
Speed
Tokens/sec340.2 tokens/s
Time to First Token4.97s
Time to Answer4.97s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| 250K | 1.5M |