Kuaishou's Flagship AI Coding Assistant Powered by Advanced Agentic Reinforcement Learning
KAT Coder is the flagship closed-source AI coding model developed by Kwaipilot, Kuaishou's AI research division specializing in AI for Software Engineering (AI4SE). As part of the KAT series (Kwaipilot-AutoThink), KAT Coder represents the pinnacle of agentic code generation technology.
Built on the Qwen model family with a Mixture-of-Experts (MoE) architecture featuring approximately 72 billion active parameters (out of over 1 trillion total), KAT Coder delivers state-of-the-art performance on complex software engineering tasks.
Kuaishou's elite AI4SE research team
~72B active params, 1T+ total
Top closed-source performance
Discover the advanced capabilities that make KAT Coder the ultimate AI coding assistant
KAT Coder leverages Mixture-of-Experts design with ~72B active parameters for efficient and powerful code generation
Comprehensive training pipeline: Mid-Training, SFT, RFT, and Agentic RL for robust KAT Coder performance
KAT Coder acts as an autonomous agent, orchestrating tools and completing full projects independently
Seamlessly integrates with debuggers, code executors, and development environments for comprehensive assistance
State-of-the-art RL training with prefix caching, entropy pruning, and SeamlessFlow distributed systems
KAT Coder excels at feature implementation, bug fixing, refactoring, testing, and more
Trained on real Git commits and pull requests for authentic development workflow understanding
Handles up to 128K tokens for comprehensive codebase understanding and multi-file reasoning
Compare KAT Coder with other models in the KAT series
Model | Parameters (Active) | Type / License | Key Tasks | SWE-Bench Verified |
---|---|---|---|---|
KAT-Dev-32B | ~32.8B | Open (Apache-2.0) | Code gen, debugging, refactoring | 62.4% |
KAT-Dev-72B-Exp | 72.7B | Open (Apache-2.0) | Experimental RL variant | 74.6% |
KAT Coder | ~72B (active) | Closed (Proprietary) | Flagship code agent model | 73.4% |
KAT-V1 (AutoThink) | 40B | Open (Apache-2.0) | General agentic coding assistant | Complex reasoning tasks |
KAT-Dev-32B and KAT-Dev-72B-Exp are available on HuggingFace under Apache-2.0 license for research and commercial use.
View on HuggingFaceKAT Coder is proprietary and accessible via StreamLake API, offering the highest performance for enterprise users.
Get API AccessUnderstanding the multi-stage training pipeline that powers KAT Coder
Pre-finetuning on agentic skills: tool use, multi-turn dialogues, Git data, reasoning
Curated trajectories across 8 task types and 8 programming domains
Bridge between SFT and RL using teacher trajectories for stable training
Large-scale self-play with prefix caching, entropy pruning, SeamlessFlow
KAT Coder supports up to 128,000 tokens for comprehensive codebase understanding
Trained on 8 task types across 8 programming domains for comprehensive expertise
MoE architecture with over 1 trillion total parameters, ~72B active at inference
Reuses common token probabilities to drastically reduce compute overhead
Focuses on informative sequences by filtering low-entropy trajectories
Custom distributed RL system for massive-scale training throughput
Multiple ways to access and integrate KAT Coder into your workflow
Access KAT-Dev-32B and KAT-Dev-72B-Exp directly from HuggingFace. These open-source variants of KAT Coder are available under Apache-2.0 license.
# Install transformers
pip install transformers torch
# Load KAT-Dev-32B
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "Kwaipilot/KAT-Dev"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
model_name,
device_map="auto",
trust_remote_code=True
)
# Generate code
messages = [
{"role": "user", "content": "Write a Python function to sort a list"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0]))
Access the flagship KAT Coder model through Kuaishou's StreamLake (Wanqing) platform. Request an API key and integrate with tools like Claude Code.
Visit console.streamlake.com/wanqing and create an account
Apply for KAT Coder API access in the dashboard
Use Claude Code, vLLM, or custom integrations to leverage KAT Coder
Comprehensive documentation for integrating KAT Coder into CI/CD pipelines, IDEs, and development workflows.
Command-line tools and scripts
VSCode, JetBrains integrations
GitHub Actions, GitLab CI
Deploy KAT-Dev models locally
KAT Coder achieves industry-leading results on software engineering benchmarks
Model | Type | Parameters | SWE-Bench Score | Specialty |
---|---|---|---|---|
KAT Coder | Closed | ~72B active | 73.4% | Agentic code generation |
KAT-Dev-72B-Exp | Open | 72.7B | 74.6% | RL-optimized variant |
Claude Sonnet 4.5 | Closed | Unknown | ~75% | General-purpose + code |
DeepSeek-Coder-33B | Open | 33B | ~58% | Code generation |
Code Llama 34B | Open | 34B | ~55% | Code completion |
KAT Coder ranks among the best code generation models globally, rivaling top closed-source systems
Excels at practical software engineering tasks: bug fixes, feature implementation, refactoring, testing
Open KAT-Dev models for research, closed KAT Coder for production
Everything you need to know about KAT Coder