Llama 3.2 Instruct 11B (Vision)
MetaLlama开源权重Llama 3.2 Community License
描述
Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.
发布日期
2024-09-25
参数规模
10.6B
上下文长度
131K
支持模态
image, text
能力雷达图
17
general
11
coding
13
reasoning
15
science估算
12
agents
90
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Biology
GPQA
32.8%自报
Finance
MMLU
73.0%自报
General
MMMU
50.7%自报
MMMU-Pro
33.0%自报
Image To Text
DocVQA
88.4%自报
VQAv2 (test)
75.2%自报
Math
MGSM
68.9%自报
MATH
51.9%自报
MathVista
51.5%自报
Multimodal
AI2D
91.1%自报
ChartQA
83.4%自报
AA 评测指数
Intelligence Index3.3
Math Index1.7
Math 5000.5
Mmlu Pro0.5
Ifbench0.3
Gpqa0.2
Tau20.1
Lcr0.1
Scicode0.1
Livecodebench0.1
Aime0.1
Hle0.1
Aime 250.0
Terminalbench Hard0.0
LLM Stats 分类评分
Image To Text90
Language70
Legal70
Multimodal70
Finance70
Vision70
Math60
Reasoning60
Healthcare60
General50
Physics30
Biology30
Chemistry30
定价
输入价格$0.245 / 1M tokens
输出价格$0.245 / 1M tokens
混合价格(3:1)$0.245 / 1M tokens
速度
Tokens/秒85.7
首Token延迟0.55s
首回答延迟0.55s
供应商价格排行
供应商价格排行
10 个供应商
最便宜: Cloudflare Workers AI最贵: Azure
供应商输入输出
1Cloudflare Workers AI最便宜
$0.0485
$0.676
2Kilo Gateway
$0.049
$0.049
3Cloudflare AI Gateway
$0.049
$0.68
4Inference
$0.055
$0.055
5LLM Gateway
$0.07
$0.33
6Vercel AI Gateway
$0.16
$0.16
7Meta主要
$0.245
$0.245
8OpenRouter
$0.345
$0.345
9Azure Cognitive Services
$0.37
$0.37
10Azure
$0.37
$0.37
比较该模型在不同 API 供应商之间的定价。