Alibaba logo
Alibaba

Qwen3.6 27B (Reasoning)

Released Apr 2026

Intelligence
#43
Coding
#71
Context262K
Parameters27B

Qwen3.6-27B is a dense multimodal language model developed by Alibaba's Qwen team, designed to prioritize agentic coding and complex reasoning. Unlike many contemporary large-scale models that rely on Mixture-of-Experts (MoE) architectures, Qwen3.6-27B utilizes a fully dense 27-billion-parameter design. Despite its smaller total parameter count, it has demonstrated the ability to outperform significantly larger models, such as the 397B-parameter Qwen3.5 MoE, across various coding and repository-level benchmarks like SWE-bench and Terminal-Bench 2.0.

The model introduces a unique hybrid architecture that combines Gated DeltaNet linear attention with traditional Gated Attention. This layout uses a repeating pattern where three out of every four sublayers employ linear attention, significantly improving computational efficiency and KV cache management. This design allows the model to support a native context window of 262,144 tokens, which can be extended up to 1,010,000 tokens using YaRN scaling, making it suitable for processing extensive codebases and long-form documents.

A key innovation in this release is Thinking Preservation, a mechanism that allows the model to retain reasoning traces across multiple turns of a conversation. This feature is particularly beneficial for agentic workflows, as it prevents the model from needing to re-evaluate its logic on every step of a multi-turn task. Qwen3.6-27B is natively multimodal, supporting text, image, and video inputs within a single unified checkpoint.

Capabilities and Usage

The model operates in both "thinking" and "non-thinking" modes. By default, Qwen3.6-27B generates a reasoning chain before providing its final response to enhance accuracy in complex tasks. Users can toggle these modes via API parameters, such as using preserve_thinking: True to maintain context in agents. For optimal results in coding and debugging, developers are advised to maintain a context length of at least 128K tokens to ensure the model's reasoning capabilities are fully preserved.

Rankings & Comparison