Llama 3.2 Instruct 3B
MetaLlamaOpen WeightLlama 3.2 Community License
Description
Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.
Release Date
2024-09-25
Parameters
3.2B
Context Length
131K
Modalities
text
Capability Radar
17
general
8
coding
13
reasoning
14
scienceest.
50
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 450 | 6.0 | AA |
| General Ranking | 440 | 19.0 | AA |
| Math Reasoning | 332 | 12.0 | AA |
| Reasoning | 45 | 70.0 | LS |
| Science | 466 | 12.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
32.8%SR
Finance
MMLU
63.4%SR
General
ARC-C
78.6%SR
IFEval
77.4%SR
BFCL v2
67.0%SR
Nexus
34.3%SR
Language
Open-rewrite
40.1%SR
TLDR9+ (test)
19.0%SR
Long Context
NIH/Multi-needle
84.7%SR
InfiniteBench/En.MC
63.3%SR
InfiniteBench/En.QA
19.8%SR
Math
GSM8k
77.7%SR
MGSM
58.2%SR
MATH
48.0%SR
Reasoning
HellaSwag
69.8%SR
AA Evaluation Indices
Intelligence Index9.7
Math Index3.3
Math 5000.5
Mmlu Pro0.3
Ifbench0.3
Gpqa0.3
Tau20.2
Livecodebench0.1
Aime0.1
Hle0.1
Scicode0.1
Aime 250.0
Lcr0.0
LLM Stats Category Scores
Structured Output80
Instruction Following80
Finance60
General60
Healthcare60
Language60
Legal60
Math60
Reasoning60
Tool Calling50
Biology30
Chemistry30
Physics30
Pricing
Input Price$0.15 / 1M tokens
Output Price$0.15 / 1M tokens
Blended Price (3:1)$0.15 / 1M tokens
Speed
Tokens/sec52.1 tokens/s
Time to First Token0.68s
Time to Answer0.68s
Available Providers
(LS internal units)No provider data available