Skip to main content

Gemma 4 12B (Reasoning)

GoogleGemma
Release Date
2026-06-03
Parameters
Context Length
131K
Modalities
image, text

Capability Radar

26
general
27
coding
75
reasoning
50
scienceest.
61
agents
70
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking182
47.0
AA
General Ranking194
52.0
AA
Science126
58.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA42.4%SR

Code

HumanEval87.8%SR
LiveCodeBench29.7%SR

Factuality

FACTS Grounding74.9%SR
SimpleQA10.0%SR

Finance

MMLU-Pro67.5%SR

General

IFEval90.4%SR
Natural2Code84.5%SR
Global-MMLU-Lite75.1%SR
MBPP0.74 / 100SR
MMMU (val)64.9%SR
BIG-Bench Extra Hard19.3%SR

Image To Text

DocVQA86.6%SR
VQAv2 (val)71.0%SR
TextVQA65.1%SR

Language

BIG-Bench Hard87.6%SR
WMT24++53.4%SR
ECLeKTic16.7%SR

Math

GSM8k95.9%SR
MATH89.0%SR
MathVista-Mini67.6%SR
HiddenMath60.3%SR

Multimodal

AI2D84.5%SR
ChartQA78.0%SR
InfoVQA70.6%SR

Reasoning

Bird-SQL (dev)54.4%SR

AA Evaluation Indices

Intelligence Index
29.1
Coding Index
24.9
Gpqa
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Tau2
0.4
Terminalbench Hard
0.2
Hle
0.1

LLM Stats Category Scores

Structured Output
90
Instruction Following
90
Math
80
Vision
70
Finance
70
Grounding
70
Healthcare
70
Image To Text
70
Legal
70
Multimodal
70
General
60
Language
60
Reasoning
60
Code
60
Factuality
40
Physics
40
Biology
40
Chemistry
40

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources