Kimi K2
Kimi K2
Open-weights model from Moonshot AI (China). Released July 11, 2025. A 1-trillion-parameter Mixture-of-Experts architecture focused on agentic coding and long-horizon tool use. The base lineage for subsequent K2.x versions.
Architecture
- Total parameters: 1.04 trillion
- Active parameters per token: 32 billion
- Experts: 384 (8 selected + 1 shared per token); up from 256 in DeepSeek-V3
- Attention: Multi-head Latent Attention (MLA)
- Hidden dimension: 7168 (model), 2048 (MoE expert)
- Layers: 61
- Context window: 128K tokens (K2 base); extended to 256K in K2.6
- License: Modified MIT (allows commercial use)
Benchmarks (Kimi K2, base release)
| Benchmark | Score |
|---|---|
| SWE-bench Verified | 65.8% |
| Tau2-Bench | 66.1 |
| ACEBench (En) | 76.5 |
| LiveCodeBench v6 | 53.7 |
| AIME 2025 | 49.5 |
| GPQA-Diamond | 75.1 |
Artificial Analysis Intelligence Index score: 26 (above median of 23 for open-weight non-reasoning models of comparable size at release).
K2.6 (April 2026)
K2.6 is the current production version. Key changes over the base K2:
- Context window extended to 256K tokens
- Native vision via a 400M-parameter MoonViT encoder (image and video input)
- Agent Swarm scales to 300 sub-agents and 4,000 coordinated steps
- Hallucination rate reduced from 65% (K2.5) to 39%
- Intelligence Index score: 54 — leading all open-weight models, behind only Anthropic, Google, and OpenAI (all at 57)
K2.6 SWE-bench Pro: 58.6, ahead of GPT-5.4 (57.7), Claude Opus 4.6 at max effort (53.4), and Gemini 3.1 Pro (54.2).
K2.6 pricing: $0.95 / $4.00 per million input/output tokens.
Comparison to Other Open-Weight Models
At the time of the base K2 release, Nemotron Ultra (NVIDIA) was the other prominent open-weight contender at the 1T scale. K2 led on agentic benchmarks (Tau2-Bench, ACEBench) and matched or exceeded it on coding. K2.6 now leads the open-weight category outright on the Artificial Analysis Intelligence Index.
The MoE design — large total parameter count, low active parameters per forward pass — follows the architecture pattern of DeepSeek-V3 but with a larger expert pool.
Availability
Weights available on Hugging Face under Modified MIT License. K2.6 available via Moonshot's API and third-party providers including DeepInfra, Novita, Baseten, Fireworks, and Parasail.
Related
deepseek-r1 · evals · agentic-workflows