Gemma 4 E4B (Non-reasoning)
GoogleGemmaOpen WeightApache 2.0 · Commercial OK
Description
Gemma 4 E4B is Google DeepMind's compact multimodal model with 4.5 billion effective parameters (8B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Features Per-Layer Embeddings for efficient on-device deployment while maintaining strong multimodal capabilities.
Release Date
2026-04-03
Parameters
8.0B
Context Length
33K
Modalities
text
Capability Radar
13
general
6
coding
55
reasoning
27
scienceest.
60
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 52 | 57.0 | LS |
| Code Ranking | 381 | 15.0 | AA |
| General Ranking | 392 | 26.0 | AA |
| Science | 401 | 23.0 | AA |
Benchmark Scores (LLM Stats)
Agents
t2-bench
57.5%SR
Biology
GPQA
58.6%SR
Finance
MMLU-Pro
69.4%SR
General
MMMLU
76.6%SR
MMMU-Pro
52.6%SR
LiveCodeBench v6
52.0%SR
BIG-Bench Extra Hard
33.1%SR
MRCR v2
25.4%SR
Healthcare
MedXpertQA
28.7%SR
Math
MathVision
59.5%SR
AIME 2026
42.5%SR
AA Evaluation Indices
Intelligence Index14.8
Coding Index6.4
Gpqa0.5
Ifbench0.4
Tau20.3
Lcr0.2
Terminalbench Hard0.1
Hle0.0
Scicode0.0
LLM Stats Category Scores
Finance70
Legal70
Tool Calling60
Agents60
Biology60
Chemistry60
Language60
Math60
Physics60
Vision50
General50
Healthcare50
Multimodal50
Reasoning50
Long Context30
Pricing
Input Price$0.3 / 1M tokens
Output Price$1.25 / 1M tokens
Blended Price (3:1)$0.537 / 1M tokens
Speed
Tokens/sec54.2 tokens/s
Time to First Token0.48s
Time to Answer0.48s
Available Providers
(LS internal units)No provider data available