Pixtral-12B

Mistral AIOpen WeightApache 2.0 · Commercial OK

Description

A 12B parameter multimodal model with a 400M parameter vision encoder, capable of understanding both natural images and documents. Excels at multimodal tasks while maintaining strong text-only performance. Supports variable image sizes and multiple images in context.

Release Date

2024-09-17

Parameters

12.4B

Context Length

128K

Modalities

image, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Multimodal Ranking	48	76.0	LS

Benchmark Scores (LLM Stats)

Code

HumanEval

72.0%SR

Communication

MT-Bench

0.77 / 100SR

MM-MT-Bench

0.60 / 100SR

Finance

MMLU

69.2%SR

General

IFEval

61.3%SR

MMMU

52.5%SR

Image To Text

DocVQA

90.7%SR

VQAv2

78.6%SR

Math

MathVista

58.0%SR

MATH

48.1%SR

Multimodal

ChartQA

81.8%SR

MM IF-Eval

52.7%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Image To Text

Roleplay

Creativity

Language

Legal

Multimodal

Reasoning

Finance

Code

Communication

Vision

Instruction Following

Math

Structured Output

General

Healthcare

Pricing

Input Price$0.15 / 1M tokens

Output Price$0.15 / 1M tokens

Blended Price (3:1)$0.15 / 1M tokens

Speed

No speed data available

Provider Price Ranking

4 providers

Cheapest: Mistral AIMost Expensive: Scaleway

ProviderInputOutput

1Mistral AIPRIMARY

$0.15

2Mistral

$0.15

3Vercel AI Gateway

$0.15

4Scaleway

$0.2

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis