Sarvam 105B (high)
SarvamOpen WeightApache 2.0 · Commercial OK
Description
Sarvam-105B is Sarvam AI's flagship open-source Mixture-of-Experts reasoning model built for complex reasoning, coding, and agentic workflows. It uses 128 sparse experts with Multi-head Latent Attention for efficient long-context inference and was pre-trained on 12 trillion tokens spanning code, mathematics, multilingual, and web data.
Release Date
2026-03-06
Parameters
105.0B
Context Length
131K
Modalities
text
Capability Radar
12
general
26
coding
74
reasoning
44
scienceest.
50
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 87 | 50.0 | LS |
| Code Ranking | 498 | 1.0 | AA |
| General Ranking | 380 | 29.0 | AA |
| Science | 222 | 46.0 | AA |
Benchmark Scores (LLM Stats)
Agents
BrowseComp
49.5%SR
Biology
GPQA
78.7%SR
Code
SWE-Bench Verified
45.0%SR
Creativity
Arena-Hard v2
71.0%SR
Finance
MMLU
90.6%SR
MMLU-Pro
81.7%SR
General
IFEval
84.8%SR
LiveCodeBench v6
71.7%SR
Math
MATH-500
98.6%SR
AIME 2025
96.7%SR
HMMT25
85.8%SR
HMMT 2025
85.8%SR
Beyond AIME
69.1%SR
Humanity's Last Exam
11.2%SR
AA Evaluation Indices
Intelligence Index11.9
Gpqa0.7
Tau20.5
Ifbench0.3
Scicode0.3
Hle0.1
Terminalbench Hard0.0
Lcr0.0
LLM Stats Category Scores
Language90
Legal90
Finance90
Healthcare90
Instruction Following80
Math80
Physics80
Structured Output80
General80
Biology80
Chemistry80
Reasoning70
Creativity70
Writing70
Search50
Frontend Development50
Agents50
Code50
Vision10
Pricing
Input Price$0.042 / 1M tokens
Output Price$0.17 / 1M tokens
Blended Price (3:1)$0.074 / 1M tokens
Speed
Tokens/sec117.0
Time to First Token1.24s
Time to Answer18.33s
Provider Price Ranking
Provider Price Ranking
3 providers
Cheapest: FastRouterMost Expensive: NanoGPT
ProviderInputOutput
1FastRouterCheapest
$0.04
$0.16
2SarvamPRIMARY
$0.042
$0.17
3NanoGPT
$0.045
$0.177
Compare pricing across different API providers for this model.