Mistral Large (Feb '24)

MistralMistralOpen WeightApache 2.0 · Commercial OK

Description

Mistral Large 3 (675B Instruct 2512) is a state-of-the-art general-purpose Multimodal granular Mixture-of-Experts model with 41B active parameters and 675B total parameters trained from scratch with 3000 H200s. This model is the instruct post-trained version in FP8, fine-tuned for instruction tasks, making it ideal for chat, agentic and instruction based use cases. A no-loss FP8 version to reduce resource requirements. Can be deployed on a node of B200s or H200s. Designed for reliability and long-context comprehension - It is engineered for production-grade assistants, retrieval-augmented systems, scientific workloads, and complex enterprise workflows.

Date de sortie

2024-02-26

Paramètres

675.0B

Longueur du contexte

128K

Modalités

image, text

Radar de capacités

general

coding

reasoning

scienceest.

agents

multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine	#Rang	Score	Source
Code Ranking	355	19.0	AA
General Ranking	410	24.0	AA
Math Reasoning	288	25.0	AA
Science	404	23.0	AA

Scores de benchmarks (LLM Stats)

Biology

GPQA

43.9%Aut.

Code

LiveCodeBench

34.4%Aut.

Factuality

SimpleQA

23.8%Aut.

General

MMMLU

85.5%Aut.

Math

AMC_2022_23

52.0%Aut.

Indices d'évaluation AA

Intelligence Index

9.9

Math 500

0.5

Mmlu Pro

0.5

Gpqa

0.4

Scicode

0.2

Livecodebench

0.2

Hle

0.0

Aime

0.0

Scores par catégorie LLM Stats

Language

Math

General

Reasoning

Biology

Chemistry

Physics

Code

Factuality

Tarification

Prix d'entrée$4 / 1M tokens

Prix de sortie$12 / 1M tokens

Prix mixte (3:1)$6 / 1M tokens

Vitesse

Tokens/sec0.0 tokens/s

Délai du premier token0.00s

Temps de réponse0.00s

Fournisseurs disponibles

(Unités internes LS)

Fournisseur	Prix d'entrée	Prix de sortie
Mistral AI	500K	1.5M

Sources externes

LLM Stats