跳转到主要内容

Gemma 4 12B (Non-reasoning)

GoogleGemma

描述

Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.

发布日期
2026-06-03
参数规模
上下文长度
131K
支持模态
image, text

能力雷达图

12
general
19
coding
66
reasoning
41
science估算
52
agents
50
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
代码能力榜295
26.0
AA
通用能力榜362
30.0
AA
科学能力264
42.0
AA

基准测试分数 (LLM Stats)

Audio

CoVoST238.5%自报

Biology

GPQA78.8%自报

Finance

MMLU-Pro77.2%自报

General

MMMLU83.4%自报
LiveCodeBench v672.0%自报
MMMU-Pro69.1%自报
BIG-Bench Extra Hard53.0%自报
MRCR v243.4%自报

Healthcare

MedXpertQA48.7%自报

Language

FLEURS93.1%自报

Math

MathVision79.7%自报
AIME 202677.5%自报
CodeForces0.55 / 3000自报
Humanity's Last Exam5.2%自报

Multimodal

OmniDocBench 1.516.4%自报

AA 评测指数

Coding Index
17.5
Intelligence Index
13.2
Gpqa
0.7
Ifbench
0.5
Tau2
0.3
Lcr
0.3
Scicode
0.3
Terminalbench Hard
0.1
Hle
0.1

LLM Stats 分类评分

Legal
80
Physics
80
Finance
80
Biology
80
Chemistry
80
Language
70
Speech To Text
70
General
70
Math
60
Reasoning
60
Healthcare
60
Multimodal
50
Long Context
40
Audio
40
Vision
40
Structured Output
20

定价

输入价格免费
输出价格免费
混合价格(3:1)免费

速度

Tokens/秒0.0
首Token延迟0.00s
首回答延迟0.00s

供应商价格排行

暂无提供商数据

外部链接