Llama 3.2 Instruct 11B (Vision)
MetaLlama開源權重Llama 3.2 Community License
描述
Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.
發布日期
2024-09-25
參數規模
10.6B
上下文長度
131K
支援模態
image, text
能力雷達圖
17
general
11
coding
13
reasoning
15
science估算
12
agents
90
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Biology
GPQA
32.8%自報
Finance
MMLU
73.0%自報
General
MMMU
50.7%自報
MMMU-Pro
33.0%自報
Image To Text
DocVQA
88.4%自報
VQAv2 (test)
75.2%自報
Math
MGSM
68.9%自報
MATH
51.9%自報
MathVista
51.5%自報
Multimodal
AI2D
91.1%自報
ChartQA
83.4%自報
AA 評測指數
Intelligence Index3.3
Math Index1.7
Math 5000.5
Mmlu Pro0.5
Ifbench0.3
Gpqa0.2
Tau20.1
Lcr0.1
Scicode0.1
Livecodebench0.1
Aime0.1
Hle0.1
Aime 250.0
Terminalbench Hard0.0
LLM Stats 分類評分
Image To Text90
Language70
Legal70
Multimodal70
Finance70
Vision70
Math60
Reasoning60
Healthcare60
General50
Physics30
Biology30
Chemistry30
定價
輸入價格$0.245 / 1M tokens
輸出價格$0.245 / 1M tokens
混合價格(3:1)$0.245 / 1M tokens
速度
Tokens/秒85.7
首Token延遲0.55s
首回答延遲0.55s
供應商價格排行
供應商價格排行
10 個供應商
最便宜: Cloudflare Workers AI最貴: Azure
供應商輸入輸出
1Cloudflare Workers AI最便宜
$0.0485
$0.676
2Kilo Gateway
$0.049
$0.049
3Cloudflare AI Gateway
$0.049
$0.68
4Inference
$0.055
$0.055
5LLM Gateway
$0.07
$0.33
6Vercel AI Gateway
$0.16
$0.16
7Meta主要
$0.245
$0.245
8OpenRouter
$0.345
$0.345
9Azure Cognitive Services
$0.37
$0.37
10Azure
$0.37
$0.37
比較該模型在不同 API 供應商之間的定價。