Pixtral-12B

Mistral AIOpen WeightApache 2.0 · Uso Comercial

Descripción

A 12B parameter multimodal model with a 400M parameter vision encoder, capable of understanding both natural images and documents. Excels at multimodal tasks while maintaining strong text-only performance. Supports variable image sizes and multiple images in context.

Fecha de lanzamiento

2024-09-17

Parámetros

12.4B

Longitud del contexto

128K

Modalidades

image, text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Ranking multimodal	48	76.0	LS

Puntuaciones de benchmarks (LLM Stats)

Code

HumanEval

72.0%Aut.

Communication

MT-Bench

0.77 / 100Aut.

MM-MT-Bench

0.60 / 100Aut.

Finance

MMLU

69.2%Aut.

General

IFEval

61.3%Aut.

MMMU

52.5%Aut.

Image To Text

DocVQA

90.7%Aut.

VQAv2

78.6%Aut.

Math

MathVista

58.0%Aut.

MATH

48.1%Aut.

Multimodal

ChartQA

81.8%Aut.

MM IF-Eval

52.7%Aut.

Índices de evaluación AA

No hay datos de evaluación AA disponibles

Puntuaciones por categoría LLM Stats

Image To Text

Roleplay

Creativity

Language

Legal

Multimodal

Reasoning

Finance

Code

Communication

Vision

Instruction Following

Math

Structured Output

General

Healthcare

Precios

Precio de entrada$0.15 / 1M tokens

Precio de salida$0.15 / 1M tokens

Precio mixto (3:1)$0.15 / 1M tokens

Velocidad

No hay datos de velocidad disponibles

Ranking de Precios por Proveedor

4 proveedores

Más barato: Mistral AIMás caro: Scaleway

ProveedorEntradaSalida

1Mistral AIPRINCIPAL

$0.15

2Mistral

$0.15

3Vercel AI Gateway

$0.15

4Scaleway

$0.2

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis