Mistral Large (Feb '24)
MistralMistralOpen WeightApache 2.0 · Commercial OK
描述
Mistral Large 3 (675B Instruct 2512) is a state-of-the-art general-purpose Multimodal granular Mixture-of-Experts model with 41B active parameters and 675B total parameters trained from scratch with 3000 H200s. This model is the instruct post-trained version in FP8, fine-tuned for instruction tasks, making it ideal for chat, agentic and instruction based use cases. A no-loss FP8 version to reduce resource requirements. Can be deployed on a node of B200s or H200s. Designed for reliability and long-context comprehension - It is engineered for production-grade assistants, retrieval-augmented systems, scientific workloads, and complex enterprise workflows.
發布日期
2024-02-26
參數規模
675.0B
上下文長度
128K
支援模態
image, text
能力雷達圖
21
general
18
coding
23
reasoning
24
science估算
0
agents
75
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Biology
GPQA
43.9%自報
Code
LiveCodeBench
34.4%自報
Factuality
SimpleQA
23.8%自報
General
MMMLU
85.5%自報
Math
AMC_2022_23
52.0%自報
AA 評測指數
Intelligence Index9.9
Math 5000.5
Mmlu Pro0.5
Gpqa0.4
Scicode0.2
Livecodebench0.2
Hle0.0
Aime0.0
LLM Stats 分類評分
Language90
Math70
General50
Reasoning50
Biology40
Chemistry40
Physics40
Code30
Factuality20
定價
輸入價格$4 / 1M tokens
輸出價格$12 / 1M tokens
混合價格(3:1)$6 / 1M tokens
速度
Tokens/秒0.0 tokens/s
首Token延遲0.00s
首回答延遲0.00s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| Mistral AI | 500K | 1.5M |