Skip to main content

Llama 65B

MetaLlama
Release Date
2023-02-24
Parameters
Context Length
131K
Modalities
text

Capability Radar

2
general
90
coding
80
reasoning
43
scienceest.
80
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
General Ranking532
2.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA50.5%SR

Code

HumanEval88.4%SR

Finance

MMLU86.0%SR
MMLU-Pro68.9%SR

General

IFEval92.1%SR
MBPP EvalPlus87.6%SR
BFCL v277.3%SR

Math

MGSM91.1%SR
MATH77.0%SR

AA Evaluation Indices

Intelligence Index
2.1

LLM Stats Category Scores

Instruction Following
90
Structured Output
90
Code
90
Language
80
Legal
80
Math
80
Reasoning
80
Finance
80
Healthcare
80
Tool Calling
80
General
70
Physics
50
Biology
50
Chemistry
50

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

No provider data available

External Sources