Gemma 4 12B (Reasoning)

GoogleGemma

描述

Gemma 4 12B is Google DeepMind's encoder-free multimodal instruction-tuned model with 11.95 billion parameters and a 256K context window. It supports text, image, audio, and video inputs with text output, projecting image patches and audio waveforms directly into a single decoder-only transformer for streamlined local deployment.

发布日期

2026-06-03

参数规模

—

上下文长度

131K

支持模态

image, text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	206	51.0	AA
通用能力榜	226	49.0	AA
科学能力	163	56.0	AA

基准测试分数 (LLM Stats)

Audio

CoVoST2

38.5%自报

Biology

GPQA

78.8%自报

Finance

MMLU-Pro

77.2%自报

General

MMMLU

83.4%自报

LiveCodeBench v6

72.0%自报

MMMU-Pro

69.1%自报

BIG-Bench Extra Hard

53.0%自报

MRCR v2 (8-needle)

43.4%自报

Healthcare

MedXpertQA

48.7%自报

Language

FLEURS

93.1%自报

Math

MathVision

79.7%自报

AIME 2026

77.5%自报

CodeForces

0.55 / 3000自报

Humanity's Last Exam

5.2%自报

Multimodal

OmniDocBench 1.5

16.4%自报

AA 评测指数

Intelligence Index

22.0

Gpqa

0.8

Ifbench

0.7

Lcr

0.6

Scicode

0.4

Tau2

0.4

Terminalbench Hard

0.2

Hle

0.1

LLM Stats 分类评分

Physics

Legal

Finance

Biology

Chemistry

Speech To Text

Language

Math

Reasoning

General

Healthcare

Multimodal

Long Context

Audio

Vision

Structured Output

定价

输入价格$0.1 / 1M tokens

输出价格$0.3 / 1M tokens

混合价格(3:1)$0.15 / 1M tokens

速度

Tokens/秒126.0

首Token延迟1.45s

首回答延迟17.33s

供应商价格排行

1 个供应商

供应商输入输出

1Google主要

$0.1

$0.3

比较该模型在不同 API 供应商之间的定价。

外部链接

Artificial Analysis