KAT Coder

Kuaishou's Flagship AI Coding Assistant Powered by Advanced Agentic Reinforcement Learning

73.4%
SWE-Bench Verified Score

What is KAT Coder?

KAT Coder is the flagship closed-source AI coding model developed by Kwaipilot, Kuaishou's AI research division specializing in AI for Software Engineering (AI4SE). As part of the KAT series (Kwaipilot-AutoThink), KAT Coder represents the pinnacle of agentic code generation technology.

Built on the Qwen model family with a Mixture-of-Experts (MoE) architecture featuring approximately 72 billion active parameters (out of over 1 trillion total), KAT Coder delivers state-of-the-art performance on complex software engineering tasks.

Developed by Kwaipilot

Kuaishou's elite AI4SE research team

MoE Architecture

~72B active params, 1T+ total

Industry Leading

Top closed-source performance

Features of KAT Coder

Discover the advanced capabilities that make KAT Coder the ultimate AI coding assistant

Advanced MoE Architecture

KAT Coder leverages Mixture-of-Experts design with ~72B active parameters for efficient and powerful code generation

Multi-Stage Training

Comprehensive training pipeline: Mid-Training, SFT, RFT, and Agentic RL for robust KAT Coder performance

Agentic Code Generation

KAT Coder acts as an autonomous agent, orchestrating tools and completing full projects independently

Tool-Use Capabilities

Seamlessly integrates with debuggers, code executors, and development environments for comprehensive assistance

Reinforcement Learning

State-of-the-art RL training with prefix caching, entropy pruning, and SeamlessFlow distributed systems

8 Task Types Mastery

KAT Coder excels at feature implementation, bug fixing, refactoring, testing, and more

Git & PR Integration

Trained on real Git commits and pull requests for authentic development workflow understanding

Long Context Support

Handles up to 128K tokens for comprehensive codebase understanding and multi-file reasoning

KAT Coder Model Comparison

Compare KAT Coder with other models in the KAT series

Model Parameters (Active) Type / License Key Tasks SWE-Bench Verified
KAT-Dev-32B ~32.8B Open (Apache-2.0) Code gen, debugging, refactoring 62.4%
KAT-Dev-72B-Exp 72.7B Open (Apache-2.0) Experimental RL variant 74.6%
KAT Coder ~72B (active) Closed (Proprietary) Flagship code agent model 73.4%
KAT-V1 (AutoThink) 40B Open (Apache-2.0) General agentic coding assistant Complex reasoning tasks

Open-Source Models

KAT-Dev-32B and KAT-Dev-72B-Exp are available on HuggingFace under Apache-2.0 license for research and commercial use.

View on HuggingFace

Closed-Source Flagship

KAT Coder is proprietary and accessible via StreamLake API, offering the highest performance for enterprise users.

Get API Access

KAT Coder Architecture & Training

Understanding the multi-stage training pipeline that powers KAT Coder

Multi-Stage Training Pipeline

01

Mid-Training

Pre-finetuning on agentic skills: tool use, multi-turn dialogues, Git data, reasoning

02

Supervised Fine-Tuning

Curated trajectories across 8 task types and 8 programming domains

03

Reinforcement Fine-Tuning

Bridge between SFT and RL using teacher trajectories for stable training

04

Agentic RL

Large-scale self-play with prefix caching, entropy pruning, SeamlessFlow

128K

Context Length

KAT Coder supports up to 128,000 tokens for comprehensive codebase understanding

8×8

Task Coverage

Trained on 8 task types across 8 programming domains for comprehensive expertise

1T+

Total Parameters

MoE architecture with over 1 trillion total parameters, ~72B active at inference

KAT Coder RL Training Innovations

Prefix Caching

Reuses common token probabilities to drastically reduce compute overhead

Entropy Pruning

Focuses on informative sequences by filtering low-entropy trajectories

SeamlessFlow

Custom distributed RL system for massive-scale training throughput

How to Use KAT Coder

Multiple ways to access and integrate KAT Coder into your workflow

Open-Source Models (HuggingFace)

Access KAT-Dev-32B and KAT-Dev-72B-Exp directly from HuggingFace. These open-source variants of KAT Coder are available under Apache-2.0 license.

# Install transformers
pip install transformers torch

# Load KAT-Dev-32B
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "Kwaipilot/KAT-Dev"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",
    trust_remote_code=True
)

# Generate code
messages = [
    {"role": "user", "content": "Write a Python function to sort a list"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0]))

StreamLake API Access

Access the flagship KAT Coder model through Kuaishou's StreamLake (Wanqing) platform. Request an API key and integrate with tools like Claude Code.

1

Register on StreamLake

Visit console.streamlake.com/wanqing and create an account

2

Request API Key

Apply for KAT Coder API access in the dashboard

3

Integrate with Tools

Use Claude Code, vLLM, or custom integrations to leverage KAT Coder

Integration & Deployment Guides

Comprehensive documentation for integrating KAT Coder into CI/CD pipelines, IDEs, and development workflows.

CLI Integration

Command-line tools and scripts

IDE Plugins

VSCode, JetBrains integrations

CI/CD Pipelines

GitHub Actions, GitLab CI

Self-Hosted Deployment

Deploy KAT-Dev models locally

KAT Coder Performance Benchmarks

KAT Coder achieves industry-leading results on software engineering benchmarks

SWE-Bench Verified Performance

73.4%
KAT Coder
Closed-Source Flagship
74.6%
KAT-Dev-72B-Exp
Top Open Model
62.4%
KAT-Dev-32B
Efficient Open Model

KAT Coder vs. Other Leading Models

Model Type Parameters SWE-Bench Score Specialty
KAT Coder Closed ~72B active 73.4% Agentic code generation
KAT-Dev-72B-Exp Open 72.7B 74.6% RL-optimized variant
Claude Sonnet 4.5 Closed Unknown ~75% General-purpose + code
DeepSeek-Coder-33B Open 33B ~58% Code generation
Code Llama 34B Open 34B ~55% Code completion

Top Tier Performance

KAT Coder ranks among the best code generation models globally, rivaling top closed-source systems

Real-World Tasks

Excels at practical software engineering tasks: bug fixes, feature implementation, refactoring, testing

Best of Both Worlds

Open KAT-Dev models for research, closed KAT Coder for production

Frequently Asked Questions about KAT Coder

Everything you need to know about KAT Coder