Alibaba logo
Alibaba
Open Weights

Qwen2.5 Coder Instruct 7B

Released Nov 2024

Intelligence
#390
Context131K
Parameters7B

Qwen2.5-Coder-7B-Instruct is a specialized large language model developed by Alibaba Cloud, specifically optimized for coding tasks. It belongs to the Qwen2.5-Coder series and is built upon the Qwen2.5 architecture. This model is designed to provide code generation, completion, and debugging capabilities while remaining efficient for various development environments.\n\nThe model features 7.61 billion parameters and utilizes a decoder-only Transformer architecture. It was trained on a comprehensive dataset of 5.5 trillion tokens, including high-quality source code across 92 programming languages, technical documentation, and mathematical data. As an instruction-tuned model, it has undergone supervised fine-tuning and reinforcement learning from human feedback (RLHF) to improve its instruction-following accuracy and conversational performance.\n\nTechnical features include a context window of 128,000 tokens, enabling the processing of extensive codebases and long-form technical documentation. The model demonstrates high proficiency on benchmarks such as HumanEval and MBPP, matching the performance of much larger open-source and proprietary coding models.

Rankings & Comparison