Transparent Pricing
High-performance inference at competitive prices. Pay only for what you use with no hidden fees or commitments.
Transparent Pricing
High-performance inference at competitive prices. Pay only for what you use with no hidden fees or commitments.
Transparent Pricing
High-performance inference at competitive prices. Pay only for what you use with no hidden fees or commitments.

Serverless Pricing
Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!
Serverless Pricing
Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!
Serverless Pricing
Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!
DeepSeek
DeepSeek released the first open‑weight model and has gained global attention for building highly capable, cost‑efficient LLMs. Models such as DeepSeek‑V3.2 and DeepSeek‑R1 are competitive with top international models, delivering remarkable performance in reasoning, coding, and mathematical problem‑solving.
DeepSeek released the first open‑weight model and has gained global attention for building highly capable, cost‑efficient LLMs. Models such as DeepSeek‑V3.2 and DeepSeek‑R1 are competitive with top international models, delivering remarkable performance in reasoning, coding, and mathematical problem‑solving.
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
DeepSeek-V3.2
164K
Context Length
Input (/M Tokens)
$
0.27
$
0.42
Output (/M Tokens)
Model Name
DeepSeek-V3.2-Exp
164K
Context Length
Input (/M Tokens)
$
0.27
$
0.41
Output (/M Tokens)
Model Name
DeepSeek-V3.1-Terminus
164K
Context Length
Input (/M Tokens)
$
0.27
$
1.0
Output (/M Tokens)
Model Name
DeepSeek-V3.1
164K
Context Length
Input (/M Tokens)
$
0.27
$
1.0
Output (/M Tokens)
Model Name
DeepSeek-R1
164K
Context Length
Input (/M Tokens)
$
0.5
$
2.18
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.

Qwen
Open-source AI model family built by Alibaba Cloud, ranging from sub-1B to 480B+ parameters, designed to scale to any use case, from deep reasoning and math to autonomous coding.
Open-source AI model family built by Alibaba Cloud, ranging from sub-1B to 480B+ parameters, designed to scale to any use case, from deep reasoning and math to autonomous coding.
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
Qwen3-VL-32B-Instruct
262K
Context Length
Input (/M Tokens)
$
0.2
$
0.6
Output (/M Tokens)
Model Name
Qwen3-VL-32B-Thinking
262K
Context Length
Input (/M Tokens)
$
0.2
$
1.5
Output (/M Tokens)
Model Name
Qwen3-VL-8B-Thinking
262K
Context Length
Input (/M Tokens)
$
0.18
$
2.0
Output (/M Tokens)
Model Name
Qwen3-VL-8B-Instruct
262K
Context Length
Input (/M Tokens)
$
0.18
$
0.68
Output (/M Tokens)
Model Name
Qwen3-VL-30B-A3B-Instruct
262K
Context Length
Input (/M Tokens)
$
0.29
$
1.0
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.

Z.ai
Zhipu AI builds the ChatGLM family of LLMs, develops LLMs as Agents. The latest model, GLM-4.7, delivers frontier-level performance in coding, creative writing, and role-play scenarios.
Zhipu AI builds the ChatGLM family of LLMs, develops LLMs as Agents. The latest model, GLM-4.7, delivers frontier-level performance in coding, creative writing, and role-play scenarios.
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
GLM-4-32B-0414
33K
Context Length
Input (/M Tokens)
$
0.27
$
0.27
Output (/M Tokens)
Model Name
GLM-4-9B-0414
33K
Context Length
Input (/M Tokens)
$
0.086
$
0.086
Output (/M Tokens)
Model Name
GLM-4.1V-9B-Thinking
66K
Context Length
Input (/M Tokens)
$
0.035
$
0.14
Output (/M Tokens)
Model Name
GLM-4.5-Air
131K
Context Length
Input (/M Tokens)
$
0.14
$
0.86
Output (/M Tokens)
Model Name
GLM-4.5V
66K
Context Length
Input (/M Tokens)
$
0.14
$
0.86
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.

Moonshot AI
Moonshot AI stands out for breakthroughs in long-context language models. Its flagship product, Kimi, is especially well suited for research, legal work, and complex information synthesis. The latest release, Kimi K2 Thinking, is a state-of-the-art thinking agent with deep reasoning and tool orchestration.
Moonshot AI stands out for breakthroughs in long-context language models. Its flagship product, Kimi, is especially well suited for research, legal work, and complex information synthesis. The latest release, Kimi K2 Thinking, is a state-of-the-art thinking agent with deep reasoning and tool orchestration.
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
Kimi-Dev-72B
131K
Context Length
Input (/M Tokens)
$
0.29
$
1.15
Output (/M Tokens)
Model Name
Kimi-K2-Instruct
131K
Context Length
Input (/M Tokens)
$
0.58
$
2.29
Output (/M Tokens)
Model Name
Kimi-K2-Instruct-0905
262K
Context Length
Input (/M Tokens)
$
0.4
$
2.0
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.

MiniMaxAI
Specialized in multimodal capabilities, MiniMax develops advanced models that seamlessly integrate text, voice, and vision, with notable achievements in natural-sounding text-to-speech and voice cloning.
Specialized in multimodal capabilities, MiniMax develops advanced models that seamlessly integrate text, voice, and vision, with notable achievements in natural-sounding text-to-speech and voice cloning.
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
MiniMax-M1-80k
131K
Context Length
Input (/M Tokens)
$
0.55
$
2.2
Output (/M Tokens)
Model Name
MiniMax-M2
197K
Context Length
Input (/M Tokens)
$
0.3
$
1.2
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
OpenAI
OpenAI is a pioneering AI research organization that helped spark today's generative AI revolution. Its GPT series brought LLMs into the mainstream and is currently led by GPT-5.2 and o3, which set industry benchmarks for natural language understanding, generation, and reasoning.
OpenAI is a pioneering AI research organization that helped spark today's generative AI revolution. Its GPT series brought LLMs into the mainstream and is currently led by GPT-5.2 and o3, which set industry benchmarks for natural language understanding, generation, and reasoning.
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
gpt-oss-120b
131K
Context Length
Input (/M Tokens)
$
0.05
$
0.45
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Others
Model Name
Model Name
Context Length
Context Length
Input
Output
Actions
Model Name
DeepSeek-V3.1-Nex-N1
131K
Context Length
Input (/M Tokens)
$
0.27
$
1.0
Output (/M Tokens)
Model Name
ERNIE-4.5-300B-A47B
131K
Context Length
Input (/M Tokens)
$
0.28
$
1.1
Output (/M Tokens)
Model Name
Hunyuan-A13B-Instruct
131K
Context Length
Input (/M Tokens)
$
0.14
$
0.57
Output (/M Tokens)
Model Name
Hunyuan-MT-7B
33K
Context Length
Input (/M Tokens)
$
0.0
$
0.0
Output (/M Tokens)
Model Name
Ling-flash-2.0
131K
Context Length
Input (/M Tokens)
$
0.14
$
0.57
Output (/M Tokens)
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Prices shown are per 1 million tokens.
Image Generation
Generate high-quality images from text prompts with our state-of-the-art image generation models.
Model Name
Price (/image)
Actions
Model Name
FLUX 1.1 [pro] Ultra
$
0.06
Price (
)
/ Image
Model Name
FLUX.1 Kontext [max]
$
0.08
Price (
)
/ Image
Model Name
FLUX.1 Kontext [pro]
$
0.04
Price (
)
/ Image
Prices shown are per image generated or edited.
Prices shown are per image generated or edited.
Prices shown are per image generated or edited.
Video Generation
Create dynamic videos from text descriptions with our cutting-edge video generation models.
Model Name
Price (/video)
Actions
Wan2.1-I2V-14B-720P
Wan2.1-I2V-14B-720P
$
0.29
Price (
)
/ Video
Wan2.1-I2V-14B-720P (Turbo)
Wan2.1-I2V-14B-720P (Turbo)
$
0.21
Price (
)
/ Video
Wan2.1-T2V-14B
Wan2.1-T2V-14B
$
0.29
Price (
)
/ Video
Wan2.1-T2V-14B (Turbo)
Wan2.1-T2V-14B (Turbo)
$
0.21
Price (
)
/ Video
Wan2.2-I2V-A14B
Wan2.2-I2V-A14B
$
0.29
Price (
)
/ Video
Prices shown are per video generated.
Prices shown are per video generated.
Prices shown are per video generated.
Audio Models
Process and generate audio with our high-quality speech recognition and synthesis models.
Model Name
Output (/M UTF-8 bytes)
Actions
Model Name
Fish-Speech-1.5
$
15.0
Price (
)
/ M UTF-8 bytes
Model Name
FunAudioLLM/CosyVoice2-0.5B
$
7.15
Price (
)
/ M UTF-8 bytes
Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.
Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.
Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.

Frequently Asked Questions
How does billing work?
You're billed based on your usage. For chat models, you're charged per token for both input and output. For image, video, and audio models, pricing varies based on the specific task and output quality.
Are there any minimum commitments?
No, there are no minimum commitments. You only pay for what you use, and you can start with $1 in free credits.
Can I set spending limits?
Yes, you can set monthly spending limits in your account dashboard to control costs and prevent unexpected charges.
Do you offer volume discounts?
Yes, we offer volume discounts for high-usage customers. If your usage is substantial, please contact our sales team who can create a custom pricing plan tailored to your needs.
How do I get started?
Sign up for an account, get your API key, and start using our models right away. We provide comprehensive documentation and code examples to help you integrate quickly.
Ready to accelerate your AI development?

Ready to accelerate your AI development?

Ready to accelerate your AI development?



