Gemma 4 12B (Non-reasoning)
GoogleGemma
Description
Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.
Release Date
2026-06-03
Parameters
—
Context Length
131K
Modalities
image, text
Capability Radar
12
general
19
coding
66
reasoning
41
scienceest.
52
agents
50
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 286 | 29.0 | AA |
| General Ranking | 361 | 30.0 | AA |
| Science | 263 | 42.0 | AA |
Benchmark Scores (LLM Stats)
Audio
CoVoST2
38.5%SR
Biology
GPQA
78.8%SR
Finance
MMLU-Pro
77.2%SR
General
MMMLU
83.4%SR
LiveCodeBench v6
72.0%SR
MMMU-Pro
69.1%SR
BIG-Bench Extra Hard
53.0%SR
MRCR v2
43.4%SR
Healthcare
MedXpertQA
48.7%SR
Language
FLEURS
93.1%SR
Math
MathVision
79.7%SR
AIME 2026
77.5%SR
CodeForces
0.55 / 3000SR
Humanity's Last Exam
5.2%SR
Multimodal
OmniDocBench 1.5
16.4%SR
AA Evaluation Indices
Coding Index17.5
Intelligence Index13.2
Gpqa0.7
Ifbench0.5
Tau20.3
Lcr0.3
Scicode0.3
Terminalbench Hard0.1
Hle0.1
LLM Stats Category Scores
Legal80
Physics80
Finance80
Biology80
Chemistry80
Speech To Text70
Language70
General70
Math60
Reasoning60
Healthcare60
Multimodal50
Long Context40
Audio40
Vision40
Structured Output20
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s
Provider Price Ranking
No provider data available