Llama 3.1 Instruct 8B
MetaLlamaOpen WeightLlama 3.1 Community License
Description
Llama 3.1 8B Instruct is a multilingual large language model optimized for dialogue use cases. It features a 128K context length, state-of-the-art tool use, and strong reasoning capabilities.
Release Date
2024-07-23
Parameters
8.0B
Context Length
16K
Modalities
text
Capability Radar
22
general
8
coding
14
reasoning
17
scienceest.
50
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 420 | 10.0 | AA |
| General Ranking | 418 | 23.0 | AA |
| Math Reasoning | 324 | 14.0 | AA |
| Reasoning | 26 | 83.0 | LS |
| Science | 435 | 17.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
30.4%SR
Code
HumanEval
72.6%SR
Gorilla Benchmark API Bench
8.2%SR
Finance
MMLU (CoT)
73.0%SR
MMLU
69.4%SR
MMLU-Pro
48.3%SR
General
ARC-C
83.4%SR
IFEval
80.4%SR
BFCL
76.1%SR
MBPP EvalPlus (base)
72.8%SR
Multipl-E MBPP
52.4%SR
Multipl-E HumanEval
50.8%SR
Nexus
38.5%SR
Math
GSM-8K (CoT)
84.5%SR
Multilingual MGSM (CoT)
68.9%SR
DROP
59.5%SR
MATH (CoT)
51.9%SR
Reasoning
API-Bank
82.6%SR
AA Evaluation Indices
Intelligence Index11.8
Coding Index4.9
Math Index4.3
Math 5000.5
Mmlu Pro0.5
Ifbench0.3
Gpqa0.3
Tau20.2
Lcr0.2
Scicode0.1
Livecodebench0.1
Aime0.1
Hle0.1
Aime 250.0
Terminalbench Hard0.0
LLM Stats Category Scores
Structured Output80
Instruction Following80
Finance60
General60
Healthcare60
Language60
Legal60
Math60
Reasoning60
Tool Calling50
Code40
Biology30
Chemistry30
Physics30
Pricing
Input Price$0.1 / 1M tokens
Output Price$0.1 / 1M tokens
Blended Price (3:1)$0.1 / 1M tokens
Speed
Tokens/sec188.5 tokens/s
Time to First Token0.47s
Time to Answer0.47s
Available Providers
(LS internal units)No provider data available