Gemma 4 12B (Non-reasoning)
GoogleGemma
描述
Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.
发布日期
2026-06-03
参数规模
—
上下文长度
131K
支持模态
image, text
能力雷达图
12
general
19
coding
66
reasoning
41
science估算
52
agents
50
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Audio
CoVoST2
38.5%自报
Biology
GPQA
78.8%自报
Finance
MMLU-Pro
77.2%自报
General
MMMLU
83.4%自报
LiveCodeBench v6
72.0%自报
MMMU-Pro
69.1%自报
BIG-Bench Extra Hard
53.0%自报
MRCR v2
43.4%自报
Healthcare
MedXpertQA
48.7%自报
Language
FLEURS
93.1%自报
Math
MathVision
79.7%自报
AIME 2026
77.5%自报
CodeForces
0.55 / 3000自报
Humanity's Last Exam
5.2%自报
Multimodal
OmniDocBench 1.5
16.4%自报
AA 评测指数
Coding Index17.5
Intelligence Index13.2
Gpqa0.7
Ifbench0.5
Tau20.3
Lcr0.3
Scicode0.3
Terminalbench Hard0.1
Hle0.1
LLM Stats 分类评分
Legal80
Physics80
Finance80
Biology80
Chemistry80
Language70
Speech To Text70
General70
Math60
Reasoning60
Healthcare60
Multimodal50
Long Context40
Audio40
Vision40
Structured Output20
定价
输入价格免费
输出价格免费
混合价格(3:1)免费
速度
Tokens/秒0.0
首Token延迟0.00s
首回答延迟0.00s
供应商价格排行
暂无提供商数据