Llama 3.2 Instruct 11B (Vision)

MetaLlamaオープンウエイトLlama 3.2 Community License

説明

Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.

リリース日

2024-09-25

パラメータ

10.6B

コンテキスト長

131K

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

32.8%自己申告

Finance

MMLU

73.0%自己申告

General

MMMU

50.7%自己申告

MMMU-Pro

33.0%自己申告

Image To Text

DocVQA

88.4%自己申告

VQAv2 (test)

75.2%自己申告

Math

MGSM

68.9%自己申告

MATH

51.9%自己申告

MathVista

51.5%自己申告

Multimodal

AI2D

91.1%自己申告

ChartQA

83.4%自己申告

AA評価指数

Intelligence Index

3.3

Math Index

1.7

Math 500

0.5

Mmlu Pro

0.5

Ifbench

0.3

Gpqa

0.2

Tau2

0.1

Lcr

0.1

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.1

Aime 25

0.0

Terminalbench Hard

0.0

LLM Statsカテゴリスコア

Image To Text

Language

Legal

Multimodal

Finance

Vision

Math

Reasoning

Healthcare

General

Physics

Biology

Chemistry

価格設定

入力価格$0.245 / 1Mトークン

出力価格$0.245 / 1Mトークン

混合価格（3:1）$0.245 / 1Mトークン

速度

トークン/秒85.7

初トークン遅延0.55s

初回答遅延0.55s

プロバイダー価格ランキング

10 プロバイダー

最安: Cloudflare Workers AI最高: Azure

プロバイダー入力出力

1Cloudflare Workers AI最安

$0.0485

$0.676

2Kilo Gateway

$0.049

3Cloudflare AI Gateway

$0.049

$0.68

4Inference

$0.055

5LLM Gateway

$0.07

$0.33

6Vercel AI Gateway

$0.16

7Metaプライマリ

$0.245

8OpenRouter

$0.345

9Azure Cognitive Services

$0.37

10Azure

$0.37

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	453	9.0	AA
総合ランキング	451	20.0	AA
数学的推論	328	13.0	AA
マルチモーダルランキング	26	84.0	LS
科学	474	14.0	AA