Alibaba logo
Alibaba
Open Weights

Qwen3.5 9B (Non-reasoning)

Released Mar 2026

Intelligence
#138
Coding
#157
Context262K
Parameters9B

Qwen3.5 9B is a dense multimodal language model developed by Alibaba Cloud's Qwen team, released in March 2026 as the flagship of the Qwen3.5 "Small" series. Built to balance high-performance reasoning with computational efficiency, it utilizes a hybrid architecture that integrates Gated Delta Networks (linear attention) with standard Gated Attention (full attention). This design enables high-throughput inference and low latency while maintaining strong capabilities in language understanding, mathematics, and coding.

The model features a native context window of 262,144 tokens, which is extensible up to 1,010,000 tokens through scaling techniques like YaRN. As a natively multimodal foundation model, it employs early-fusion training to process text, images, and video within a single context. It supports 201 languages and dialects, providing broad linguistic coverage for global applications.

While the Qwen3.5 family supports an explicit "Thinking Mode" for complex reasoning traces, this version is optimized for standard instruction-following and high-speed generation. In its non-reasoning (instruct) configuration, the model excels at tool use, agentic workflows, and structured data extraction. It is released under the Apache 2.0 license, making it available for both research and commercial use.

Rankings & Comparison