Gemma 4 12B
GoogleGemmaOpen WeightApache 2.0 · Commercial OK
Description
Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.
Release Date
2026-05-23
Parameters
12.0B
Context Length
131K
Modalities
image, text
Capability Radar
70
general
0
coding
60
reasoning
68
scienceest.
42
agents
50
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Multimodal Ranking | 80 | 16.0 | LS |
Benchmark Scores (LLM Stats)
Audio
CoVoST2
38.5%SR
Biology
GPQA
78.8%SR
Finance
MMLU-Pro
77.2%SR
General
MMMLU
83.4%SR
LiveCodeBench v6
72.0%SR
MMMU-Pro
69.1%SR
BIG-Bench Extra Hard
53.0%SR
MRCR v2
43.4%SR
Healthcare
MedXpertQA
48.7%SR
Language
FLEURS
93.1%SR
Math
MathVision
79.7%SR
AIME 2026
77.5%SR
CodeForces
0.55 / 3000SR
Humanity's Last Exam
5.2%SR
Multimodal
OmniDocBench 1.5
16.4%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Finance80
Legal80
Physics80
Biology80
Chemistry80
Speech To Text70
General70
Language70
Reasoning60
Healthcare60
Math60
Multimodal50
Long Context40
Vision40
Audio40
Structured Output20
Pricing
Input Price$0.05 / 1M tokens
Output Price$0.15 / 1M tokens
Blended Price (3:1)$0.075 / 1M tokens
Speed
No speed data available
Provider Price Ranking
Provider Price Ranking
4 providers
Cheapest: Kilo GatewayMost Expensive: NovitaAI
ProviderInputOutput
1Kilo GatewayCheapest
$0.04
$0.13
2GooglePRIMARY
$0.05
$0.15
3OpenRouter
$0.05
$0.15
4NovitaAI
$0.05
$0.1
Compare pricing across different API providers for this model.