Llama 3.1 Instruct 405B
MetaLlamaOpen WeightLlama 3.1 Community License
描述
Llama 3.1 405B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks. The model supports 8 languages and has a 128K token context length.
发布日期
2024-07-23
参数规模
405.0B
上下文长度
—
支持模态
text
能力雷达图
32
general
22
coding
23
reasoning
34
science估算
70
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Biology
GPQA
50.7%自报
Code
HumanEval
89.0%自报
Gorilla Benchmark API Bench
35.3%自报
Finance
MMLU (CoT)
88.6%自报
MMLU
87.3%自报
MMLU-Pro
73.3%自报
General
ARC-C
96.9%自报
MBPP EvalPlus
88.6%自报
IFEval
88.6%自报
BFCL
88.5%自报
Multipl-E HumanEval
75.2%自报
Multipl-E MBPP
65.7%自报
Nexus
58.7%自报
Math
GSM8k
96.8%自报
Multilingual MGSM (CoT)
91.6%自报
DROP
84.8%自报
MATH
73.8%自报
Reasoning
API-Bank
92.0%自报
AA 评测指数
Intelligence Index17.4
Coding Index14.5
Math Index3.0
Mmlu Pro0.7
Math 5000.7
Gpqa0.5
Ifbench0.4
Livecodebench0.3
Scicode0.3
Lcr0.2
Aime0.2
Tau20.2
Terminalbench Hard0.1
Hle0.0
Aime 250.0
LLM Stats 分类评分
Structured Output90
Instruction Following90
Math90
Finance80
General80
Healthcare80
Language80
Legal80
Reasoning80
Tool Calling70
Code60
Biology50
Chemistry50
Physics50
定价
输入价格$2.75 / 1M tokens
输出价格$6.5 / 1M tokens
混合价格(3:1)$3.688 / 1M tokens
速度
Tokens/秒31.5 tokens/s
首Token延迟0.69s
首回答延迟0.69s
可用提供商
(LS 内部计价单位)暂无提供商数据