Llama 3.2 Instruct 11B (Vision)

MetaLlamaOpen WeightLlama 3.2 Community License

Description

Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.

Release Date

2024-09-25

Parameters

10.6B

Context Length

131K

Modalities

image, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	453	9.0	AA
General Ranking	451	20.0	AA
Math Reasoning	328	13.0	AA
Multimodal Ranking	26	84.0	LS
Science	474	14.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

32.8%SR

Finance

MMLU

73.0%SR

General

MMMU

50.7%SR

MMMU-Pro

33.0%SR

Image To Text

DocVQA

88.4%SR

VQAv2 (test)

75.2%SR

Math

MGSM

68.9%SR

MATH

51.9%SR

MathVista

51.5%SR

Multimodal

AI2D

91.1%SR

ChartQA

83.4%SR

AA Evaluation Indices

Intelligence Index

3.3

Math Index

1.7

Math 500

0.5

Mmlu Pro

0.5

Ifbench

0.3

Gpqa

0.2

Tau2

0.1

Lcr

0.1

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.1

Aime 25

0.0

Terminalbench Hard

0.0

LLM Stats Category Scores

Image To Text

Language

Legal

Multimodal

Finance

Vision

Math

Reasoning

Healthcare

General

Physics

Biology

Chemistry

Pricing

Input Price$0.245 / 1M tokens

Output Price$0.245 / 1M tokens

Blended Price (3:1)$0.245 / 1M tokens

Speed

Tokens/sec85.7

Time to First Token0.55s

Time to Answer0.55s

Provider Price Ranking

10 providers

Cheapest: Cloudflare Workers AIMost Expensive: Azure

ProviderInputOutput

1Cloudflare Workers AICheapest

$0.0485

$0.676

2Kilo Gateway

$0.049

3Cloudflare AI Gateway

$0.049

$0.68

4Inference

$0.055

5LLM Gateway

$0.07

$0.33

6Vercel AI Gateway

$0.16

7MetaPRIMARY

$0.245

8OpenRouter

$0.345

9Azure Cognitive Services

$0.37

10Azure

$0.37

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis