Llama 3.1 Tulu3 405B
Allen Institute for AI
Release Date
2025-01-30
Parameters
—
Context Length
—
Modalities
—
Capability Radar
30
general
29
coding
40
reasoning
34
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 255 | 32.0 | AA |
| General Ranking | 296 | 37.0 | AA |
| Math Reasoning | 199 | 46.0 | AA |
| Science | 313 | 35.0 | AA |
Benchmark Scores (LLM Stats)
No benchmark data available
AA Evaluation Indices
Intelligence Index14.1
Math 5000.8
Mmlu Pro0.7
Gpqa0.5
Scicode0.3
Livecodebench0.3
Aime0.1
Hle0.0
LLM Stats Category Scores
No category score data available
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0 tokens/s
Time to First Token0.00s
Time to Answer0.00s
Available Providers
(LS internal units)No provider data available
External Sources
No external links available