Skip to main content

Gemma 2 9B

GoogleGemmaOpen WeightGemma · Commercial OK

Description

Gemma 2 9B IT is an instruction-tuned version of Google's Gemma 2 9B base model. It was trained on 8 trillion tokens of web data, code, and math content. The model features sliding window attention, logit soft-capping, and knowledge distillation techniques. It's optimized for dialogue applications through supervised fine-tuning, distillation, RLHF, and model merging using WARP.

Release Date
2024-06-27
Parameters
9.2B
Context Length
Modalities

Capability Radar

70
general
40
coding
60
reasoning
68
scienceest.
0
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Reasoning29
82.0
LS

Benchmark Scores (LLM Stats)

Code

HumanEval40.2%SR

Creativity

Social IQa53.4%SR

Finance

MMLU71.3%SR

General

ARC-E88.0%SR
PIQA81.7%SR
TriviaQA76.6%SR
ARC-C68.4%SR
AGIEval52.8%SR
MBPP0.52 / 100SR
Natural Questions29.2%SR

Language

BoolQ84.2%SR
Winogrande80.6%SR
BIG-Bench68.2%SR

Math

GSM8k68.6%SR
MATH36.6%SR

Reasoning

HellaSwag81.9%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Language
80
Physics
80
Finance
70
General
70
Healthcare
70
Legal
60
Math
60
Reasoning
60
Creativity
50
Psychology
50
Code
40
Search
30

Pricing

No pricing data available

Speed

No speed data available

Available Providers

(LS internal units)

No provider data available

External Sources