Skip to main content

Llama 2 Chat 70B

MetaLlama
Release Date
2023-07-18
Parameters
Context Length
131K
Modalities
text

Capability Radar

15
general
10
coding
16
reasoning
24
scienceest.
80
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking449
10.0
AA
General Ranking474
16.0
AA
Math Reasoning325
14.0
AA
Science456
17.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA50.5%SR

Code

HumanEval88.4%SR

Finance

MMLU86.0%SR
MMLU-Pro68.9%SR

General

IFEval92.1%SR
MBPP EvalPlus87.6%SR
BFCL v277.3%SR

Math

MGSM91.1%SR
MATH77.0%SR

AA Evaluation Indices

Intelligence Index
3.0
Mmlu Pro
0.4
Gpqa
0.3
Math 500
0.3
Livecodebench
0.1
Hle
0.1
Aime
0.0

LLM Stats Category Scores

Instruction Following
90
Structured Output
90
Code
90
Language
80
Legal
80
Math
80
Reasoning
80
Finance
80
Healthcare
80
Tool Calling
80
General
70
Physics
50
Biology
50
Chemistry
50

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources