Qwen2.5 Coder Instruct 7B
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
Description
Qwen2.5-Coder is a specialized coding model trained on 5.5 trillion tokens of code data, supporting 92 programming languages with a 128K context window. It excels in code generation, completion, and repair while maintaining strong performance in math and general tasks. The model demonstrates exceptional capabilities in multi-programming language tasks and code reasoning.
Release Date
2024-09-19
Parameters
7.0B
Context Length
33K
Modalities
text
Capability Radar
20
general
13
coding
29
reasoning
21
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 399 | 14.0 | AA |
| General Ranking | 424 | 23.0 | AA |
| Math Reasoning | 250 | 35.0 | AA |
| Reasoning | 58 | 63.0 | LS |
| Science | 418 | 21.0 | AA |
Benchmark Scores (LLM Stats)
Code
HumanEval
88.4%SR
Aider
55.6%SR
LiveCodeBench
18.2%SR
Finance
MMLU-Base
68.0%SR
MMLU
67.6%SR
TruthfulQA
50.6%SR
MMLU-Pro
40.1%SR
TheoremQA
34.0%SR
General
MBPP
0.83 / 100SR
MMLU-Redux
66.6%SR
ARC-C
60.9%SR
BigCodeBench
41.0%SR
Language
Winogrande
72.9%SR
Math
GSM8k
83.9%SR
MATH
46.6%SR
STEM
34.0%SR
Reasoning
HellaSwag
76.8%SR
CRUXEval-Input-CoT
56.5%SR
CRUXEval-Output-CoT
56.0%SR
AA Evaluation Indices
Intelligence Index10.0
Math 5000.7
Mmlu Pro0.5
Gpqa0.3
Scicode0.1
Livecodebench0.1
Aime0.1
Hle0.0
LLM Stats Category Scores
General60
Language60
Math60
Reasoning60
Code50
Finance50
Healthcare50
Legal50
Physics30
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0 tokens/s
Time to First Token0.00s
Time to Answer0.00s
Available Providers
(LS internal units)No provider data available