DeepSeek-V3-0324

DeepSeek ⚡ The Runner Released March 2025

DeepSeek's updated V3 model with improved post-training using R1 techniques

DeepSeek-V3-0324

DeepSeek • March 2025

Training Data

Up to March 2025

DeepSeek-V3-0324

March 2025

Parameters

671 billion (37B active)

Training Method

Advanced MoE with R1 RL techniques

Context Window

128,000 tokens

Knowledge Cutoff

March 2025

Key Features

Improved Post-training • R1-enhanced • Advanced Performance

Capabilities

Reasoning: Outstanding

Coding: Excellent

Efficiency: Outstanding

What's New in This Version

Enhanced post-training pipeline incorporating lessons from DeepSeek-R1

DeepSeek's updated V3 model with improved post-training using R1 techniques

What's New in This Version

Enhanced post-training pipeline incorporating lessons from DeepSeek-R1

Technical Specifications

Parameters 671 billion (37B active)

Context Window 128,000 tokens

Training Method Advanced MoE with R1 RL techniques

Knowledge Cutoff March 2025

Training Data Up to March 2025

Key Features

Improved Post-training R1-enhanced Advanced Performance

Capabilities

Reasoning: Outstanding

Coding: Excellent

Efficiency: Outstanding

Other DeepSeek Models

Explore more models from DeepSeek

DeepSeek-V4-Pro

DeepSeek's frontier MoE flagship closing the gap with leading proprietary models on reasoning and agentic coding

April 2026 1.6 trillion (49B active)

DeepSeek-V4-Flash

DeepSeek's smaller, fast variant of V4 — same architecture at a fraction of the cost and latency

April 2026 284 billion (13B active)

DeepSeek-V3.2

DeepSeek's latest flagship model matching GPT-5 performance with integrated tool-use thinking

December 2025 671 billion (37B active)

Official Documentation Compare with Other Models View Timeline All DeepSeek Models