Gemma 3 4B Instruct

GoogleGemma開源權重Gemma · 商用許可

描述

Gemma 3 4B is a 4-billion-parameter vision-language model from Google, handling text and image input and generating text output. It features a 128K context window, multilingual support, and open weights. Suitable for question answering, summarization, reasoning, and image understanding tasks.

發布日期

2025-03-12

參數規模

4.0B

上下文長度

131K

支援模態

image, text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
程式碼能力榜	464	7.0	AA
通用能力榜	489	14.0	AA
數學推理	290	24.0	AA
多模態榜	76	65.0	LS
推理能力	101	36.0	LS
科學能力	479	14.0	AA

基準測試分數 (LLM Stats)

Biology

GPQA

30.8%自報

Code

HumanEval

71.3%自報

LiveCodeBench

12.6%自報

Factuality

FACTS Grounding

70.1%自報

SimpleQA

4.0%自報

Finance

MMLU-Pro

43.6%自報

General

IFEval

90.2%自報

Natural2Code

70.3%自報

MBPP

0.63 / 100自報

Global-MMLU-Lite

54.5%自報

MMMU (val)

48.8%自報

BIG-Bench Extra Hard

11.0%自報

Image To Text

DocVQA

75.8%自報

VQAv2 (val)

62.4%自報

TextVQA

57.8%自報

Language

BIG-Bench Hard

72.2%自報

WMT24++

46.8%自報

ECLeKTic

4.6%自報

Math

GSM8k

89.2%自報

MATH

75.6%自報

MathVista-Mini

50.0%自報

HiddenMath

43.0%自報

Multimodal

AI2D

74.8%自報

ChartQA

68.8%自報

InfoVQA

50.0%自報

Reasoning

Bird-SQL (dev)

36.3%自報

AA 評測指數

Math Index

12.7

Intelligence Index

1.1

Math 500

0.8

Mmlu Pro

0.4

Gpqa

0.3

Ifbench

0.3

Aime 25

0.1

Livecodebench

0.1

Scicode

0.1

Aime

0.1

Lcr

0.1

Hle

0.1

Tau2

0.0

Terminalbench Hard

0.0

LLM Stats 分類評分

Instruction Following

Structured Output

Image To Text

Grounding

Math

Multimodal

Vision

Reasoning

Healthcare

Language

Legal

Factuality

Finance

General

Code

Physics

Biology

Chemistry

定價

輸入價格$0.04 / 1M tokens

輸出價格$0.08 / 1M tokens

混合價格(3:1)$0.05 / 1M tokens

速度

Tokens/秒0.0

首Token延遲0.00s

首回答延遲0.00s

供應商價格排行

6 個供應商

最便宜: Chutes最貴: NanoGPT

供應商輸入輸出

1Chutes最便宜

$0.01

$0.0272

2Google主要

$0.04

$0.08

3Kilo Gateway

$0.04

$0.08

4Amazon Bedrock

$0.04

$0.08

5OpenRouter

$0.05

$0.1

6NanoGPT

$0.2006

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis