跳轉到主要內容

Mistral Large 3

MistralMistralOpen WeightApache 2.0 · Commercial OK

描述

Mistral Large 3 (675B Instruct 2512 Eagle) is a state-of-the-art general-purpose Multimodal granular Mixture-of-Experts model with 41B active parameters and 675B total parameters trained from scratch with 3000 H200s. This model is the base pre-trained version, not fine-tuned for instruction or reasoning tasks, making it ideal for custom post-training processes. Designed for reliability and long-context comprehension - It is engineered for production-grade assistants, retrieval-augmented systems, scientific workloads, and complex enterprise workflows. This model is the Eagle speculator for Mistral Large 3 Instruct. Depending on the task, you can expect noticeable speed-ups on your generations.

發布日期
2025-12-02
參數規模
675.0B
上下文長度
支援模態
image, text

能力雷達圖

37
general
32
coding
43
reasoning
44
science估算
0
agents
85
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
代码能力榜204
40.0
AA
通用能力榜238
43.0
AA
数学推理230
38.0
AA
科学能力216
46.0
AA

基準測試分數 (LLM Stats)

Biology

GPQA43.9%自報

Code

LiveCodeBench34.4%自報

Factuality

SimpleQA23.8%自報

General

MMMLU85.5%自報

Math

AMC_2022_2352.0%自報

AA 評測指數

Math Index
38.0
Intelligence Index
22.8
Coding Index
22.7
Mmlu Pro
0.8
Gpqa
0.7
Livecodebench
0.5
Aime 25
0.4
Scicode
0.4
Ifbench
0.4
Lcr
0.3
Tau2
0.2
Terminalbench Hard
0.2
Hle
0.0

LLM Stats 分類評分

Language
90
Math
70
General
50
Reasoning
50
Biology
40
Chemistry
40
Physics
40
Code
30
Factuality
20

定價

輸入價格$0.5 / 1M tokens
輸出價格$1.5 / 1M tokens
混合價格(3:1)$0.75 / 1M tokens

速度

Tokens/秒56.5 tokens/s
首Token延遲0.63s
首回答延遲0.63s

可用提供商

(LS 內部計價單位)

暫無提供商資料

外部連結