MAI-Thinking-1
描述
MAI-Thinking-1 is Microsoft AI's first in-house reasoning model, a 35B-active / ~1T-total parameter sparse Mixture of Experts model (base model MAI-Base-1) trained from scratch without distillation from third-party models. Built with Microsoft's Hill-Climbing Machine pipeline, it was pre-trained on 30T tokens of clean, commercially licensed, human-generated data (plus 3.55T mid-training tokens), then post-trained via reinforcement learning across STEM, agentic coding, and helpfulness/safety specialists consolidated into a single model. It delivers strong mathematical reasoning and software-engineering performance for its weight class, going toe-to-toe with Claude Opus 4.6 on SWE-Bench Pro and reaching 97.0% on AIME 2025. It supports a 256k token context window, function calling, and developer instructions, and is preferred over Claude Sonnet 4.6 in blind human side-by-side evaluations.
能力雷达图
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
| 领域 | #排名 | 分数 | 来源 |
|---|---|---|---|
| 智能体能力模型榜 | 45 | 60.0 | LS |
基准测试分数 (LLM Stats)
Agents
Biology
Code
Communication
Factuality
Finance
General
Healthcare
Long Context
Math
Safety
AA 评测指数
暂无 AA 评测数据
LLM Stats 分类评分
定价
暂无定价数据
速度
暂无速度数据
供应商价格排行
暂无提供商数据