DeepSeek-V4-Pro
About DeepSeek-V4-Pro
DeepSeek-V4-Pro is DeepSeek's flagship open-source MoE model with 1.6T total parameters and 49B activated, purpose-built for frontier-level reasoning, coding, and agentic tasks. Supporting a 1M-token context window and three reasoning effort modes up to Think Max, it achieves top-tier performance on coding benchmarks such as LiveCodeBench and Codeforces — rivaling leading closed-source models — and is released under the MIT License.
Available Serverless
Run queries immediately, pay only for usage
Input Price
$
1.74
/ M Tokens
Cache Read
$
0.145
/ M Tokens
Output Price
$
3.48
/ M Tokens
Metadata
Specification
State
Available
Architecture
Hybrid Attention MoE
Calibrated
Yes
Mixture of Experts
Yes
Total Parameters
862B
Activated Parameters
49B
Reasoning
No
Precision
FP8
Context length
1049K
Max Tokens
393K
Supported Functionality
Serverless
Supported
Serverless LoRA
Not supported
Fine-tuning
Not supported
Embeddings
Not supported
Rerankers
Not supported
Support image input
Not supported
JSON Mode
Supported
Structured Outputs
Not supported
Tools
Supported
Fim Completion
Not supported
Chat Prefix Completion
Supported
Compare with Other Models
See how this model stacks up against others.
DeepSeek
chat
DeepSeek-V4-Pro
Release on: Apr 24, 2026
Total Context:
1049K
Max output:
393K
Input:
$
1.74
/ M Tokens
Output:
$
3.48
/ M Tokens
DeepSeek
chat
DeepSeek-V4-Flash
Release on: Apr 24, 2026
Total Context:
1049K
Max output:
393K
Input:
$
0.14
/ M Tokens
Output:
$
0.28
/ M Tokens
DeepSeek
chat
DeepSeek-V3.2
Release on: Dec 4, 2025
Total Context:
164K
Max output:
164K
Input:
$
0.27
/ M Tokens
Output:
$
0.42
/ M Tokens
DeepSeek
chat
DeepSeek-V3.2-Exp
Release on: Oct 10, 2025
Total Context:
164K
Max output:
164K
Input:
$
0.27
/ M Tokens
Output:
$
0.41
/ M Tokens
DeepSeek
chat
DeepSeek-V3.1-Terminus
Release on: Sep 29, 2025
Total Context:
164K
Max output:
164K
Input:
$
0.27
/ M Tokens
Output:
$
1
/ M Tokens
DeepSeek
chat
DeepSeek-V3.1
Release on: Aug 25, 2025
Total Context:
164K
Max output:
164K
Input:
$
0.27
/ M Tokens
Output:
$
1
/ M Tokens
DeepSeek
chat
DeepSeek-V3
Release on: Dec 26, 2024
Total Context:
164K
Max output:
164K
Input:
$
0.25
/ M Tokens
Output:
$
1
/ M Tokens
DeepSeek
chat
DeepSeek-R1
Release on: May 28, 2025
Total Context:
164K
Max output:
164K
Input:
$
0.5
/ M Tokens
Output:
$
2.18
/ M Tokens
DeepSeek
chat
DeepSeek-R1-Distill-Qwen-32B
Release on: Jan 20, 2025
Total Context:
131K
Max output:
131K
Input:
$
0.18
/ M Tokens
Output:
$
0.18
/ M Tokens
