Gemma 3 4B Instruct

GoogleGemma开源权重Gemma · 商用许可

描述

Gemma 3 4B is a 4-billion-parameter vision-language model from Google, handling text and image input and generating text output. It features a 128K context window, multilingual support, and open weights. Suitable for question answering, summarization, reasoning, and image understanding tasks.

发布日期

2025-03-12

参数规模

4.0B

上下文长度

131K

支持模态

image, text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	464	7.0	AA
通用能力榜	489	14.0	AA
数学推理	290	24.0	AA
多模态榜	76	65.0	LS
推理能力	101	36.0	LS
科学能力	479	14.0	AA

基准测试分数 (LLM Stats)

Biology

GPQA

30.8%自报

Code

HumanEval

71.3%自报

LiveCodeBench

12.6%自报

Factuality

FACTS Grounding

70.1%自报

SimpleQA

4.0%自报

Finance

MMLU-Pro

43.6%自报

General

IFEval

90.2%自报

Natural2Code

70.3%自报

MBPP

0.63 / 100自报

Global-MMLU-Lite

54.5%自报

MMMU (val)

48.8%自报

BIG-Bench Extra Hard

11.0%自报

Image To Text

DocVQA

75.8%自报

VQAv2 (val)

62.4%自报

TextVQA

57.8%自报

Language

BIG-Bench Hard

72.2%自报

WMT24++

46.8%自报

ECLeKTic

4.6%自报

Math

GSM8k

89.2%自报

MATH

75.6%自报

MathVista-Mini

50.0%自报

HiddenMath

43.0%自报

Multimodal

AI2D

74.8%自报

ChartQA

68.8%自报

InfoVQA

50.0%自报

Reasoning

Bird-SQL (dev)

36.3%自报

AA 评测指数

Math Index

12.7

Intelligence Index

1.1

Math 500

0.8

Mmlu Pro

0.4

Gpqa

0.3

Ifbench

0.3

Aime 25

0.1

Livecodebench

0.1

Scicode

0.1

Aime

0.1

Lcr

0.1

Hle

0.1

Tau2

0.0

Terminalbench Hard

0.0

LLM Stats 分类评分

Instruction Following

Structured Output

Image To Text

Grounding

Math

Multimodal

Vision

Reasoning

Healthcare

Language

Legal

Factuality

Finance

General

Code

Physics

Biology

Chemistry

定价

输入价格$0.04 / 1M tokens

输出价格$0.08 / 1M tokens

混合价格(3:1)$0.05 / 1M tokens

速度

Tokens/秒0.0

首Token延迟0.00s

首回答延迟0.00s

供应商价格排行

6 个供应商

最便宜: Chutes最贵: NanoGPT

供应商输入输出

1Chutes最便宜

$0.01

$0.0272

2Google主要

$0.04

$0.08

3Kilo Gateway

$0.04

$0.08

4Amazon Bedrock

$0.04

$0.08

5OpenRouter

$0.05

$0.1

6NanoGPT

$0.2006

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis