Kimi logo
Kimi
Open Weights

Kimi K2.6

Released Apr 2026

Intelligence
#9
Coding
#20
Arena AI
#26
Context256K
Parameters1T

Kimi K2.6 is a 1-trillion parameter Mixture-of-Experts (MoE) model developed by Moonshot AI, specifically optimized for agentic workflows, long-horizon coding, and multimodal understanding. Released in April 2026, it functions as a native multimodal system, integrating text, image, and video processing within a unified architecture. The model is released as an open-weight resource under a Modified MIT License, providing high-performance autonomous capabilities for complex reasoning tasks.

The architecture employs a sparse MoE configuration with 384 total experts, where 8 experts are routed per token in addition to one shared expert, resulting in 32 billion active parameters per forward pass. It utilizes Multi-head Latent Attention (MLA) and the SwiGLU activation function. To ensure stability at its 1-trillion-parameter scale, the model was trained using the proprietary Muon (MuonClip) optimizer. It features a native vision encoder, MoonViT, which handles visual inputs without requiring separate vision-language bridging modules.

A defining characteristic of Kimi K2.6 is its horizontal scaling through Agent Swarms, which can orchestrate up to 300 sub-agents across 4,000 coordinated steps. This allows the system to manage complex, multi-day engineering tasks autonomously, such as overhauling legacy codebases or managing persistent system operations. The model supports a context window of 262,144 tokens, enabling the processing of extensive documentation and long-running conversation histories in a single session.

The model is available in several variants, including Instant for speed-sensitive tasks and Thinking for deep reasoning. It has demonstrated high proficiency in software engineering benchmarks, including SWE-Bench Pro and HLE-Full, where it performs competitively with contemporary closed-source frontier models. Kimi K2.6 also includes integrated features for professional workflows, such as automatic context caching, JSON Mode, and native tool-use for internet search and code execution.

Rankings & Comparison