step3 API, Deployment, Pricing
stepfun-ai/step3
Step3 is a cutting-edge multimodal reasoning model from StepFun. It is built on a Mixture-of-Experts (MoE) architecture with 321B total parameters and 38B active parameters. The model is designed end-to-end to minimize decoding costs while delivering top-tier performance in vision-language reasoning. Through the co-design of Multi-Matrix Factorization Attention (MFA) and Attention-FFN Disaggregation (AFD), Step3 maintains exceptional efficiency across both flagship and low-end accelerators. During pretraining, Step3 processed over 20T text tokens and 4T image-text mixed tokens, spanning more than ten languages. The model has achieved state-of-the-art performance for open-source models on various benchmarks, including math, code, and multimodality
Details
Model Provider
stepfun-ai
Type
text
Sub Type
chat
Size
321B
Publish Time
Aug 6, 2025
Input Price
$
0.57
/ M Tokens
Output Price
$
1.42
/ M Tokens
Context length
66K
Tags
MoE,321B,66K
Compare with Other Models
See how this model stacks up against others.
Model FAQs: Usage, Deployment
Learn how to use, fine-tune, and deploy this model with ease.
What is the Step3 model, and what are its core capabilities and technical specifications?
In which business scenarios does Step3 perform well? Which industries or applications is it suitable for?
How can the performance and effectiveness of Step3 be optimized in actual business use?
Compared with other models, when should Step3 be selected?
What are SiliconFlow's key strengths in AI serverless deployment for Step3?
What makes SiliconFlow the top platform for Step3 API?