LATEST MODEL

DeepSeek-V3-0324

DeepSeek Released March 2025

DeepSeek's updated V3 model with improved post-training using R1 techniques

DeepSeek-V3-0324

DeepSeekMarch 2025

Latest

Training Data

Up to March 2025

DeepSeek-V3-0324

March 2025

Parameters

671 billion (37B active)

Training Method

Advanced MoE with R1 RL techniques

Context Window

128,000 tokens

Knowledge Cutoff

March 2025

Key Features

Improved Post-training • R1-enhanced • Advanced Performance

Capabilities

Reasoning: Outstanding

Coding: Excellent

Efficiency: Outstanding

What's New in This Version

Enhanced post-training pipeline incorporating lessons from DeepSeek-R1

DeepSeek's updated V3 model with improved post-training using R1 techniques

What's New in This Version

Enhanced post-training pipeline incorporating lessons from DeepSeek-R1

Technical Specifications

Parameters 671 billion (37B active)
Context Window 128,000 tokens
Training Method Advanced MoE with R1 RL techniques
Knowledge Cutoff March 2025
Training Data Up to March 2025

Key Features

Improved Post-training R1-enhanced Advanced Performance

Capabilities

Reasoning: Outstanding
Coding: Excellent
Efficiency: Outstanding

Other DeepSeek Models

Explore more models from DeepSeek