Gemini 3.1 Flash-Lite Preview
GoogleGeminiProprietary
描述
Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction of the cost of larger models, with 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Supports text, image, video, audio, and PDF input with a 1 million-token context window.
发布日期
2026-03-03
参数规模
—
上下文长度
1.0M
支持模态
audio, file, image, text, video
能力雷达图
30
general
32
coding
82
reasoning
55
science估算
0
agents
80
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Biology
GPQA
86.9%自报
Factuality
SimpleQA
43.3%自报
FACTS Grounding
40.6%自报
General
MMMLU
88.9%自报
MMMU-Pro
76.8%自报
MRCR v2 (8-needle)
60.1%自报
Healthcare
VideoMMMU
84.8%自报
Math
Humanity's Last Exam
16.0%自报
Multimodal
CharXiv-R
73.2%自报
AA 评测指数
Intelligence Index33.5
Coding Index30.1
Gpqa0.8
Ifbench0.8
Lcr0.7
Scicode0.4
Tau20.3
Terminalbench Hard0.2
Hle0.2
LLM Stats 分类评分
Biology90
Chemistry90
Language90
Physics90
General80
Multimodal80
Vision60
Long Context60
Reasoning60
Healthcare50
Math50
Factuality40
Grounding40
定价
输入价格$0.25 / 1M tokens
输出价格$1.5 / 1M tokens
混合价格(3:1)$0.563 / 1M tokens
速度
Tokens/秒340.2 tokens/s
首Token延迟4.97s
首回答延迟4.97s
可用提供商
(LS 内部计价单位)| 提供商 | 输入价格 | 输出价格 |
|---|---|---|
| 250K | 1.5M |