Skip to main content

Gemini 3.1 Flash-Lite Preview

GoogleGeminiProprietary

Description

Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction of the cost of larger models, with 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Supports text, image, video, audio, and PDF input with a 1 million-token context window.

Release Date
2026-03-03
Parameters
Context Length
1.0M
Modalities
audio, file, image, text, video

Capability Radar

30
general
32
coding
82
reasoning
55
scienceest.
0
agents
80
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking113
57.0
AA
General Ranking155
57.0
AA
Multimodal Ranking55
73.0
LS
Science80
64.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA86.9%SR

Factuality

SimpleQA43.3%SR
FACTS Grounding40.6%SR

General

MMMLU88.9%SR
MMMU-Pro76.8%SR
MRCR v2 (8-needle)60.1%SR

Healthcare

VideoMMMU84.8%SR

Math

Humanity's Last Exam16.0%SR

Multimodal

CharXiv-R73.2%SR

AA Evaluation Indices

Intelligence Index
33.5
Coding Index
30.1
Gpqa
0.8
Ifbench
0.8
Lcr
0.7
Scicode
0.4
Tau2
0.3
Terminalbench Hard
0.2
Hle
0.2

LLM Stats Category Scores

Biology
90
Chemistry
90
Language
90
Physics
90
General
80
Multimodal
80
Vision
60
Long Context
60
Reasoning
60
Healthcare
50
Math
50
Factuality
40
Grounding
40

Pricing

Input Price$0.25 / 1M tokens
Output Price$1.5 / 1M tokens
Blended Price (3:1)$0.563 / 1M tokens

Speed

Tokens/sec340.2 tokens/s
Time to First Token4.97s
Time to Answer4.97s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
Google250K1.5M

External Sources