Qwen2.5 Coder Instruct 7B

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Descripción

Qwen2.5-Coder is a specialized coding model trained on 5.5 trillion tokens of code data, supporting 92 programming languages with a 128K context window. It excels in code generation, completion, and repair while maintaining strong performance in math and general tasks. The model demonstrates exceptional capabilities in multi-programming language tasks and code reasoning.

Fecha de lanzamiento

2024-09-19

Parámetros

7.0B

Longitud del contexto

33K

Modalidades

text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Code Ranking	399	14.0	AA
General Ranking	424	23.0	AA
Math Reasoning	250	35.0	AA
Reasoning	58	63.0	LS
Science	418	21.0	AA

Puntuaciones de benchmarks (LLM Stats)

Code

HumanEval

88.4%Aut.

Aider

55.6%Aut.

LiveCodeBench

18.2%Aut.

Finance

MMLU-Base

68.0%Aut.

MMLU

67.6%Aut.

TruthfulQA

50.6%Aut.

MMLU-Pro

40.1%Aut.

TheoremQA

34.0%Aut.

General

MBPP

0.83 / 100Aut.

MMLU-Redux

66.6%Aut.

ARC-C

60.9%Aut.

BigCodeBench

41.0%Aut.

Language

Winogrande

72.9%Aut.

Math

GSM8k

83.9%Aut.

MATH

46.6%Aut.

STEM

34.0%Aut.

Reasoning

HellaSwag

76.8%Aut.

CRUXEval-Input-CoT

56.5%Aut.

CRUXEval-Output-CoT

56.0%Aut.

Índices de evaluación AA

Intelligence Index

10.0

Math 500

0.7

Mmlu Pro

0.5

Gpqa

0.3

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.0

Puntuaciones por categoría LLM Stats

General

Language

Math

Reasoning

Code

Finance

Healthcare

Legal

Physics

Precios

Precio de entradaGratis

Precio de salidaGratis

Precio mixto (3:1)Gratis

Velocidad

Tokens/seg0.0 tokens/s

Retraso del primer token0.00s

Tiempo hasta la respuesta0.00s

Proveedores disponibles

(Unidades internas LS)

No hay datos de proveedores disponibles

Fuentes externas

LLM Stats