跳转到主要内容

Gemma 2 9B

GoogleGemmaOpen WeightGemma · Commercial OK

描述

Gemma 2 9B IT is an instruction-tuned version of Google's Gemma 2 9B base model. It was trained on 8 trillion tokens of web data, code, and math content. The model features sliding window attention, logit soft-capping, and knowledge distillation techniques. It's optimized for dialogue applications through supervised fine-tuning, distillation, RLHF, and model merging using WARP.

发布日期
2024-06-27
参数规模
9.2B
上下文长度
支持模态

能力雷达图

70
general
40
coding
60
reasoning
68
science估算
0
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
推理能力29
82.0
LS

基准测试分数 (LLM Stats)

Code

HumanEval40.2%自报

Creativity

Social IQa53.4%自报

Finance

MMLU71.3%自报

General

ARC-E88.0%自报
PIQA81.7%自报
TriviaQA76.6%自报
ARC-C68.4%自报
AGIEval52.8%自报
MBPP0.52 / 100自报
Natural Questions29.2%自报

Language

BoolQ84.2%自报
Winogrande80.6%自报
BIG-Bench68.2%自报

Math

GSM8k68.6%自报
MATH36.6%自报

Reasoning

HellaSwag81.9%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Language
80
Physics
80
Finance
70
General
70
Healthcare
70
Legal
60
Math
60
Reasoning
60
Creativity
50
Psychology
50
Code
40
Search
30

定价

暂无定价数据

速度

暂无速度数据

可用提供商

(LS 内部计价单位)

暂无提供商数据

外部链接