Mercury 2
InceptionProprietary
描述
Mercury 2 is the fastest reasoning LLM, built on diffusion-based language model (dLLM) architecture. Instead of generating text token-by-token, it refines multiple text blocks simultaneously, achieving over 1,000 tokens per second on Nvidia Blackwell GPUs — 5x faster than leading speed-optimized LLMs. Supports tool usage and JSON output with 128K context window.
發布日期
2026-02-20
參數規模
—
上下文長度
128K
支援模態
text
能力雷達圖
29
general
32
coding
77
reasoning
51
science估算
50
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Biology
GPQA
74.0%自報
SciCode
38.0%自報
Code
LiveCodeBench
67.0%自報
Communication
Tau2 Airline
53.0%自報
General
IFBench
71.0%自報
Math
AIME 2025
91.1%自報
AA 評測指數
Intelligence Index32.8
Coding Index30.6
Gpqa0.8
Tau20.7
Ifbench0.7
Scicode0.4
Lcr0.4
Terminalbench Hard0.3
Hle0.2
LLM Stats 分類評分
General70
Instruction Following70
Biology60
Chemistry60
Math60
Physics60
Reasoning60
Tool Calling50
Code50
Communication50
定價
輸入價格$0.25 / 1M tokens
輸出價格$0.75 / 1M tokens
混合價格(3:1)$0.375 / 1M tokens
速度
Tokens/秒881.5 tokens/s
首Token延遲3.71s
首回答延遲3.71s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| Inception | 250K | 750K |