MAI-Thinking-1
描述
MAI-Thinking-1 is Microsoft AI's first in-house reasoning model, a 35B-active / ~1T-total parameter sparse Mixture of Experts model (base model MAI-Base-1) trained from scratch without distillation from third-party models. Built with Microsoft's Hill-Climbing Machine pipeline, it was pre-trained on 30T tokens of clean, commercially licensed, human-generated data (plus 3.55T mid-training tokens), then post-trained via reinforcement learning across STEM, agentic coding, and helpfulness/safety specialists consolidated into a single model. It delivers strong mathematical reasoning and software-engineering performance for its weight class, going toe-to-toe with Claude Opus 4.6 on SWE-Bench Pro and reaching 97.0% on AIME 2025. It supports a 256k token context window, function calling, and developer instructions, and is preferred over Claude Sonnet 4.6 in blind human side-by-side evaluations.
能力雷達圖
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
| 領域 | #排名 | 分數 | 來源 |
|---|---|---|---|
| 智慧體能力模型榜 | 45 | 60.0 | LS |
基準測試分數 (LLM Stats)
Agents
Biology
Code
Communication
Factuality
Finance
General
Healthcare
Long Context
Math
Safety
AA 評測指數
暫無 AA 評測資料
LLM Stats 分類評分
定價
暫無定價資料
速度
暫無速度資料
供應商價格排行
暫無提供商資料