KAT Coder - Advanced AI Code Generation by Kwaipilot

What is KAT Coder?

KAT Coder is the flagship closed-source AI coding model developed by Kwaipilot, Kuaishou's AI research division specializing in AI for Software Engineering (AI4SE). As part of the KAT series (Kwaipilot-AutoThink), KAT Coder represents the pinnacle of agentic code generation technology.

Built on the Qwen model family with a Mixture-of-Experts (MoE) architecture featuring approximately 72 billion active parameters (out of over 1 trillion total), KAT Coder delivers state-of-the-art performance on complex software engineering tasks.

Developed by Kwaipilot

Kuaishou's elite AI4SE research team

MoE Architecture

~72B active params, 1T+ total

Industry Leading

Top closed-source performance

Features of KAT Coder

Discover the advanced capabilities that make KAT Coder the ultimate AI coding assistant

Advanced MoE Architecture

KAT Coder leverages Mixture-of-Experts design with ~72B active parameters for efficient and powerful code generation

Multi-Stage Training

Comprehensive training pipeline: Mid-Training, SFT, RFT, and Agentic RL for robust KAT Coder performance

Agentic Code Generation

KAT Coder acts as an autonomous agent, orchestrating tools and completing full projects independently

Tool-Use Capabilities

Seamlessly integrates with debuggers, code executors, and development environments for comprehensive assistance

Reinforcement Learning

State-of-the-art RL training with prefix caching, entropy pruning, and SeamlessFlow distributed systems

8 Task Types Mastery

KAT Coder excels at feature implementation, bug fixing, refactoring, testing, and more

Git & PR Integration

Trained on real Git commits and pull requests for authentic development workflow understanding

Long Context Support

Handles up to 128K tokens for comprehensive codebase understanding and multi-file reasoning

KAT Coder Model Comparison

Compare KAT Coder with other models in the KAT series

Model	Parameters (Active)	Type / License	Key Tasks	SWE-Bench Verified
KAT-Dev-32B	~32.8B	Open (Apache-2.0)	Code gen, debugging, refactoring	62.4%
KAT-Dev-72B-Exp	72.7B	Open (Apache-2.0)	Experimental RL variant	74.6%
KAT Coder	~72B (active)	Closed (Proprietary)	Flagship code agent model	73.4%
KAT-V1 (AutoThink)	40B	Open (Apache-2.0)	General agentic coding assistant	Complex reasoning tasks

Open-Source Models

KAT-Dev-32B and KAT-Dev-72B-Exp are available on HuggingFace under Apache-2.0 license for research and commercial use.

View on HuggingFace

Closed-Source Flagship

KAT Coder is proprietary and accessible via StreamLake API, offering the highest performance for enterprise users.

Get API Access

KAT Coder Architecture & Training

Understanding the multi-stage training pipeline that powers KAT Coder

Multi-Stage Training Pipeline

Mid-Training

Pre-finetuning on agentic skills: tool use, multi-turn dialogues, Git data, reasoning

Supervised Fine-Tuning

Curated trajectories across 8 task types and 8 programming domains

Reinforcement Fine-Tuning

Bridge between SFT and RL using teacher trajectories for stable training

Agentic RL

Large-scale self-play with prefix caching, entropy pruning, SeamlessFlow

128K

Context Length

KAT Coder supports up to 128,000 tokens for comprehensive codebase understanding

8×8

Task Coverage

Trained on 8 task types across 8 programming domains for comprehensive expertise

1T+

Total Parameters

MoE architecture with over 1 trillion total parameters, ~72B active at inference

KAT Coder RL Training Innovations

Prefix Caching

Reuses common token probabilities to drastically reduce compute overhead

Entropy Pruning

Focuses on informative sequences by filtering low-entropy trajectories

SeamlessFlow

Custom distributed RL system for massive-scale training throughput

How to Use KAT Coder

Multiple ways to access and integrate KAT Coder into your workflow

Open-Source Models (HuggingFace)

Access KAT-Dev-32B and KAT-Dev-72B-Exp directly from HuggingFace. These open-source variants of KAT Coder are available under Apache-2.0 license.

# Install transformers
pip install transformers torch

# Load KAT-Dev-32B
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "Kwaipilot/KAT-Dev"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",
    trust_remote_code=True
)

# Generate code
messages = [
    {"role": "user", "content": "Write a Python function to sort a list"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0]))

View on HuggingFace

StreamLake API Access

Access the flagship KAT Coder model through Kuaishou's StreamLake (Wanqing) platform. Request an API key and integrate with tools like Claude Code.

Register on StreamLake

Visit console.streamlake.com/wanqing and create an account

Request API Key

Apply for KAT Coder API access in the dashboard

Integrate with Tools

Use Claude Code, vLLM, or custom integrations to leverage KAT Coder

Get API Access

Integration & Deployment Guides

Comprehensive documentation for integrating KAT Coder into CI/CD pipelines, IDEs, and development workflows.

CLI Integration

Command-line tools and scripts

IDE Plugins

VSCode, JetBrains integrations

CI/CD Pipelines

GitHub Actions, GitLab CI

Self-Hosted Deployment

Deploy KAT-Dev models locally

KAT Coder Performance Benchmarks

KAT Coder achieves industry-leading results on software engineering benchmarks

SWE-Bench Verified Performance

73.4%

KAT Coder

Closed-Source Flagship

74.6%

KAT-Dev-72B-Exp

Top Open Model

62.4%

KAT-Dev-32B

Efficient Open Model

KAT Coder vs. Other Leading Models

Model	Type	Parameters	SWE-Bench Score	Specialty
KAT Coder	Closed	~72B active	73.4%	Agentic code generation
KAT-Dev-72B-Exp	Open	72.7B	74.6%	RL-optimized variant
Claude Sonnet 4.5	Closed	Unknown	~75%	General-purpose + code
DeepSeek-Coder-33B	Open	33B	~58%	Code generation
Code Llama 34B	Open	34B	~55%	Code completion

Top Tier Performance

KAT Coder ranks among the best code generation models globally, rivaling top closed-source systems

Real-World Tasks

Excels at practical software engineering tasks: bug fixes, feature implementation, refactoring, testing

Best of Both Worlds

Open KAT-Dev models for research, closed KAT Coder for production

Frequently Asked Questions about KAT Coder

Everything you need to know about KAT Coder