Skip to main content

Gemma 4 12B (Non-reasoning)

GoogleGemma

Description

Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.

Release Date
2026-06-03
Parameters
Context Length
131K
Modalities
image, text

Capability Radar

12
general
19
coding
66
reasoning
41
scienceest.
52
agents
50
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking286
29.0
AA
General Ranking361
30.0
AA
Science263
42.0
AA

Benchmark Scores (LLM Stats)

Audio

CoVoST238.5%SR

Biology

GPQA78.8%SR

Finance

MMLU-Pro77.2%SR

General

MMMLU83.4%SR
LiveCodeBench v672.0%SR
MMMU-Pro69.1%SR
BIG-Bench Extra Hard53.0%SR
MRCR v243.4%SR

Healthcare

MedXpertQA48.7%SR

Language

FLEURS93.1%SR

Math

MathVision79.7%SR
AIME 202677.5%SR
CodeForces0.55 / 3000SR
Humanity's Last Exam5.2%SR

Multimodal

OmniDocBench 1.516.4%SR

AA Evaluation Indices

Coding Index
17.5
Intelligence Index
13.2
Gpqa
0.7
Ifbench
0.5
Tau2
0.3
Lcr
0.3
Scicode
0.3
Terminalbench Hard
0.1
Hle
0.1

LLM Stats Category Scores

Legal
80
Physics
80
Finance
80
Biology
80
Chemistry
80
Speech To Text
70
Language
70
General
70
Math
60
Reasoning
60
Healthcare
60
Multimodal
50
Long Context
40
Audio
40
Vision
40
Structured Output
20

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources