Gemma 2 9B
GoogleGemmaOpen WeightGemma · Commercial OK
Description
Gemma 2 9B IT is an instruction-tuned version of Google's Gemma 2 9B base model. It was trained on 8 trillion tokens of web data, code, and math content. The model features sliding window attention, logit soft-capping, and knowledge distillation techniques. It's optimized for dialogue applications through supervised fine-tuning, distillation, RLHF, and model merging using WARP.
Release Date
2024-06-27
Parameters
9.2B
Context Length
—
Modalities
—
Capability Radar
70
general
40
coding
60
reasoning
68
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Reasoning | 29 | 82.0 | LS |
Benchmark Scores (LLM Stats)
Code
HumanEval
40.2%SR
Creativity
Social IQa
53.4%SR
Finance
MMLU
71.3%SR
General
ARC-E
88.0%SR
PIQA
81.7%SR
TriviaQA
76.6%SR
ARC-C
68.4%SR
AGIEval
52.8%SR
MBPP
0.52 / 100SR
Natural Questions
29.2%SR
Language
BoolQ
84.2%SR
Winogrande
80.6%SR
BIG-Bench
68.2%SR
Math
GSM8k
68.6%SR
MATH
36.6%SR
Reasoning
HellaSwag
81.9%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Language80
Physics80
Finance70
General70
Healthcare70
Legal60
Math60
Reasoning60
Creativity50
Psychology50
Code40
Search30
Pricing
No pricing data available
Speed
No speed data available
Available Providers
(LS internal units)No provider data available