Serverless Pricing

Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!

Serverless Pricing

Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!

Serverless Pricing

Flexible token pricing, high usage limits, and postpaid billing—plus $1 in free credits to get you started!

DeepSeek

DeepSeek released the first open‑weight model and has gained global attention for building highly capable, cost‑efficient LLMs. Models such as DeepSeek‑V3.2 and DeepSeek‑R1 are competitive with top international models, delivering remarkable performance in reasoning, coding, and mathematical problem‑solving.

DeepSeek released the first open‑weight model and has gained global attention for building highly capable, cost‑efficient LLMs. Models such as DeepSeek‑V3.2 and DeepSeek‑R1 are competitive with top international models, delivering remarkable performance in reasoning, coding, and mathematical problem‑solving.

Model Name

Context Length

Context Length

Input

Output

Actions

DeepSeek-V3.2

164K

$

0.27

$

0.42

Model Name

DeepSeek-V3.2

164K

Context Length

Input (/M Tokens)

$

0.27

$

0.42

Output (/M Tokens)

DeepSeek-V3.2

164K

$

0.27

$

0.42

DeepSeek-V3.2-Exp

164K

$

0.27

$

0.41

Model Name

DeepSeek-V3.2-Exp

164K

Context Length

Input (/M Tokens)

$

0.27

$

0.41

Output (/M Tokens)

DeepSeek-V3.2-Exp

164K

$

0.27

$

0.41

DeepSeek-V3.1-Terminus

164K

$

0.27

$

1.0

Model Name

DeepSeek-V3.1-Terminus

164K

Context Length

Input (/M Tokens)

$

0.27

$

1.0

Output (/M Tokens)

DeepSeek-V3.1-Terminus

164K

$

0.27

$

1.0

DeepSeek-V3.1

164K

$

0.27

$

1.0

Model Name

DeepSeek-V3.1

164K

Context Length

Input (/M Tokens)

$

0.27

$

1.0

Output (/M Tokens)

DeepSeek-V3.1

164K

$

0.27

$

1.0

DeepSeek-R1

164K

$

0.5

$

2.18

Model Name

DeepSeek-R1

164K

Context Length

Input (/M Tokens)

$

0.5

$

2.18

Output (/M Tokens)

DeepSeek-R1

164K

$

0.5

$

2.18

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Qwen

Open-source AI model family built by Alibaba Cloud, ranging from sub-1B to 480B+ parameters, designed to scale to any use case, from deep reasoning and math to autonomous coding.

Open-source AI model family built by Alibaba Cloud, ranging from sub-1B to 480B+ parameters, designed to scale to any use case, from deep reasoning and math to autonomous coding.

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

Qwen3-VL-32B-Instruct

262K

$

0.2

$

0.6

Model Name

Qwen3-VL-32B-Instruct

262K

Context Length

Input (/M Tokens)

$

0.2

$

0.6

Output (/M Tokens)

Qwen3-VL-32B-Instruct

262K

$

0.2

$

0.6

Qwen3-VL-32B-Thinking

262K

$

0.2

$

1.5

Model Name

Qwen3-VL-32B-Thinking

262K

Context Length

Input (/M Tokens)

$

0.2

$

1.5

Output (/M Tokens)

Qwen3-VL-32B-Thinking

262K

$

0.2

$

1.5

Qwen3-VL-8B-Thinking

262K

$

0.18

$

2.0

Model Name

Qwen3-VL-8B-Thinking

262K

Context Length

Input (/M Tokens)

$

0.18

$

2.0

Output (/M Tokens)

Qwen3-VL-8B-Thinking

262K

$

0.18

$

2.0

Qwen3-VL-8B-Instruct

262K

$

0.18

$

0.68

Model Name

Qwen3-VL-8B-Instruct

262K

Context Length

Input (/M Tokens)

$

0.18

$

0.68

Output (/M Tokens)

Qwen3-VL-8B-Instruct

262K

$

0.18

$

0.68

Qwen3-VL-30B-A3B-Instruct

262K

$

0.29

$

1.0

Model Name

Qwen3-VL-30B-A3B-Instruct

262K

Context Length

Input (/M Tokens)

$

0.29

$

1.0

Output (/M Tokens)

Qwen3-VL-30B-A3B-Instruct

262K

$

0.29

$

1.0

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Z.ai

Zhipu AI builds the ChatGLM family of LLMs, develops LLMs as Agents. The latest model, GLM-4.7, delivers frontier-level performance in coding, creative writing, and role-play scenarios.

Zhipu AI builds the ChatGLM family of LLMs, develops LLMs as Agents. The latest model, GLM-4.7, delivers frontier-level performance in coding, creative writing, and role-play scenarios.

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

GLM-4-32B-0414

33K

$

0.27

$

0.27

Model Name

GLM-4-32B-0414

33K

Context Length

Input (/M Tokens)

$

0.27

$

0.27

Output (/M Tokens)

GLM-4-32B-0414

33K

$

0.27

$

0.27

GLM-4-9B-0414

33K

$

0.086

$

0.086

Model Name

GLM-4-9B-0414

33K

Context Length

Input (/M Tokens)

$

0.086

$

0.086

Output (/M Tokens)

GLM-4-9B-0414

33K

$

0.086

$

0.086

GLM-4.1V-9B-Thinking

66K

$

0.035

$

0.14

Model Name

GLM-4.1V-9B-Thinking

66K

Context Length

Input (/M Tokens)

$

0.035

$

0.14

Output (/M Tokens)

GLM-4.1V-9B-Thinking

66K

$

0.035

$

0.14

GLM-4.5-Air

131K

$

0.14

$

0.86

Model Name

GLM-4.5-Air

131K

Context Length

Input (/M Tokens)

$

0.14

$

0.86

Output (/M Tokens)

GLM-4.5-Air

131K

$

0.14

$

0.86

GLM-4.5V

66K

$

0.14

$

0.86

Model Name

GLM-4.5V

66K

Context Length

Input (/M Tokens)

$

0.14

$

0.86

Output (/M Tokens)

GLM-4.5V

66K

$

0.14

$

0.86

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Moonshot AI

Moonshot AI stands out for breakthroughs in long-context language models. Its flagship product, Kimi, is especially well suited for research, legal work, and complex information synthesis. The latest release, Kimi K2 Thinking, is a state-of-the-art thinking agent with deep reasoning and tool orchestration.

Moonshot AI stands out for breakthroughs in long-context language models. Its flagship product, Kimi, is especially well suited for research, legal work, and complex information synthesis. The latest release, Kimi K2 Thinking, is a state-of-the-art thinking agent with deep reasoning and tool orchestration.

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

Kimi-Dev-72B

131K

$

0.29

$

1.15

Model Name

Kimi-Dev-72B

131K

Context Length

Input (/M Tokens)

$

0.29

$

1.15

Output (/M Tokens)

Kimi-Dev-72B

131K

$

0.29

$

1.15

Kimi-K2-Instruct

131K

$

0.58

$

2.29

Model Name

Kimi-K2-Instruct

131K

Context Length

Input (/M Tokens)

$

0.58

$

2.29

Output (/M Tokens)

Kimi-K2-Instruct

131K

$

0.58

$

2.29

Kimi-K2-Instruct-0905

262K

$

0.4

$

2.0

Model Name

Kimi-K2-Instruct-0905

262K

Context Length

Input (/M Tokens)

$

0.4

$

2.0

Output (/M Tokens)

Kimi-K2-Instruct-0905

262K

$

0.4

$

2.0

Kimi-K2-Thinking

262K

$

0.55

$

2.5

Model Name

Kimi-K2-Thinking

262K

Context Length

Input (/M Tokens)

$

0.55

$

2.5

Output (/M Tokens)

Kimi-K2-Thinking

262K

$

0.55

$

2.5

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

MiniMaxAI

Specialized in multimodal capabilities, MiniMax develops advanced models that seamlessly integrate text, voice, and vision, with notable achievements in natural-sounding text-to-speech and voice cloning.

Specialized in multimodal capabilities, MiniMax develops advanced models that seamlessly integrate text, voice, and vision, with notable achievements in natural-sounding text-to-speech and voice cloning.

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

MiniMax-M1-80k

131K

$

0.55

$

2.2

Model Name

MiniMax-M1-80k

131K

Context Length

Input (/M Tokens)

$

0.55

$

2.2

Output (/M Tokens)

MiniMax-M1-80k

131K

$

0.55

$

2.2

MiniMax-M2

197K

$

0.3

$

1.2

Model Name

MiniMax-M2

197K

Context Length

Input (/M Tokens)

$

0.3

$

1.2

Output (/M Tokens)

MiniMax-M2

197K

$

0.3

$

1.2

MiniMax-M2.1

197K

$

0.29

$

1.2

Model Name

MiniMax-M2.1

197K

Context Length

Input (/M Tokens)

$

0.29

$

1.2

Output (/M Tokens)

MiniMax-M2.1

197K

$

0.29

$

1.2

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

OpenAI

OpenAI is a pioneering AI research organization that helped spark today's generative AI revolution. Its GPT series brought LLMs into the mainstream and is currently led by GPT-5.2 and o3, which set industry benchmarks for natural language understanding, generation, and reasoning.

OpenAI is a pioneering AI research organization that helped spark today's generative AI revolution. Its GPT series brought LLMs into the mainstream and is currently led by GPT-5.2 and o3, which set industry benchmarks for natural language understanding, generation, and reasoning.

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

gpt-oss-120b

131K

$

0.05

$

0.45

Model Name

gpt-oss-120b

131K

Context Length

Input (/M Tokens)

$

0.05

$

0.45

Output (/M Tokens)

gpt-oss-120b

131K

$

0.05

$

0.45

gpt-oss-20b

131K

$

0.04

$

0.18

Model Name

gpt-oss-20b

131K

Context Length

Input (/M Tokens)

$

0.04

$

0.18

Output (/M Tokens)

gpt-oss-20b

131K

$

0.04

$

0.18

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Others

Model Name

Model Name

Context Length

Context Length

Input

Output

Actions

DeepSeek-V3.1-Nex-N1

131K

$

0.27

$

1.0

Model Name

DeepSeek-V3.1-Nex-N1

131K

Context Length

Input (/M Tokens)

$

0.27

$

1.0

Output (/M Tokens)

DeepSeek-V3.1-Nex-N1

131K

$

0.27

$

1.0

ERNIE-4.5-300B-A47B

131K

$

0.28

$

1.1

Model Name

ERNIE-4.5-300B-A47B

131K

Context Length

Input (/M Tokens)

$

0.28

$

1.1

Output (/M Tokens)

ERNIE-4.5-300B-A47B

131K

$

0.28

$

1.1

Hunyuan-A13B-Instruct

131K

$

0.14

$

0.57

Model Name

Hunyuan-A13B-Instruct

131K

Context Length

Input (/M Tokens)

$

0.14

$

0.57

Output (/M Tokens)

Hunyuan-A13B-Instruct

131K

$

0.14

$

0.57

Hunyuan-MT-7B

33K

$

0.0

$

0.0

Model Name

Hunyuan-MT-7B

33K

Context Length

Input (/M Tokens)

$

0.0

$

0.0

Output (/M Tokens)

Hunyuan-MT-7B

33K

$

0.0

$

0.0

Ling-flash-2.0

131K

$

0.14

$

0.57

Model Name

Ling-flash-2.0

131K

Context Length

Input (/M Tokens)

$

0.14

$

0.57

Output (/M Tokens)

Ling-flash-2.0

131K

$

0.14

$

0.57

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Prices shown are per 1 million tokens.

Image Generation

Generate high-quality images from text prompts with our state-of-the-art image generation models.

Model Name

Price (/image)

Actions

FLUX 1.1 [pro]

$

0.04

Model Name

FLUX 1.1 [pro]

$

0.04

Price (

)

/ Image

FLUX 1.1 [pro]

$

0.04

FLUX 1.1 [pro] Ultra

$

0.06

Model Name

FLUX 1.1 [pro] Ultra

$

0.06

Price (

)

/ Image

FLUX 1.1 [pro] Ultra

$

0.06

FLUX.1 Kontext [max]

$

0.08

Model Name

FLUX.1 Kontext [max]

$

0.08

Price (

)

/ Image

FLUX.1 Kontext [max]

$

0.08

FLUX.1 Kontext [pro]

$

0.04

Model Name

FLUX.1 Kontext [pro]

$

0.04

Price (

)

/ Image

FLUX.1 Kontext [pro]

$

0.04

FLUX.1-dev

$

0.014

Model Name

FLUX.1-dev

$

0.014

Price (

)

/ Image

FLUX.1-dev

$

0.014

Prices shown are per image generated or edited.

Prices shown are per image generated or edited.

Prices shown are per image generated or edited.

Video Generation

Create dynamic videos from text descriptions with our cutting-edge video generation models.

Model Name

Price (/video)

Actions

Wan2.1-I2V-14B-720P

$

0.29

Wan2.1-I2V-14B-720P

Wan2.1-I2V-14B-720P

$

0.29

Price (

)

/ Video

Wan2.1-I2V-14B-720P

$

0.29

Wan2.1-I2V-14B-720P (Turbo)

$

0.21

Wan2.1-I2V-14B-720P (Turbo)

Wan2.1-I2V-14B-720P (Turbo)

$

0.21

Price (

)

/ Video

Wan2.1-I2V-14B-720P (Turbo)

$

0.21

Wan2.1-T2V-14B

$

0.29

Wan2.1-T2V-14B

Wan2.1-T2V-14B

$

0.29

Price (

)

/ Video

Wan2.1-T2V-14B

$

0.29

Wan2.1-T2V-14B (Turbo)

$

0.21

Wan2.1-T2V-14B (Turbo)

Wan2.1-T2V-14B (Turbo)

$

0.21

Price (

)

/ Video

Wan2.1-T2V-14B (Turbo)

$

0.21

Wan2.2-I2V-A14B

$

0.29

Wan2.2-I2V-A14B

Wan2.2-I2V-A14B

$

0.29

Price (

)

/ Video

Wan2.2-I2V-A14B

$

0.29

Wan2.2-T2V-A14B

$

0.29

Wan2.2-T2V-A14B

Wan2.2-T2V-A14B

$

0.29

Price (

)

/ Video

Wan2.2-T2V-A14B

$

0.29

Prices shown are per video generated.

Prices shown are per video generated.

Prices shown are per video generated.

Audio Models

Process and generate audio with our high-quality speech recognition and synthesis models.

Model Name

Output (/M UTF-8 bytes)

Actions

Fish-Speech-1.5

$

15.0

Model Name

Fish-Speech-1.5

$

15.0

Price (

)

/ M UTF-8 bytes

Fish-Speech-1.5

$

15.0

FunAudioLLM/CosyVoice2-0.5B

$

7.15

Model Name

FunAudioLLM/CosyVoice2-0.5B

$

7.15

Price (

)

/ M UTF-8 bytes

FunAudioLLM/CosyVoice2-0.5B

$

7.15

IndexTTS-2

$

7.15

Model Name

IndexTTS-2

$

7.15

Price (

)

/ M UTF-8 bytes

IndexTTS-2

$

7.15

Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.

Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.

Prices for transcription and translation are per minute of audio. Text-to-Speech prices are per 1,000 characters.

Frequently Asked Questions

How does billing work?

You're billed based on your usage. For chat models, you're charged per token for both input and output. For image, video, and audio models, pricing varies based on the specific task and output quality.

Are there any minimum commitments?

No, there are no minimum commitments. You only pay for what you use, and you can start with $1 in free credits.

Can I set spending limits?

Yes, you can set monthly spending limits in your account dashboard to control costs and prevent unexpected charges.

Do you offer volume discounts?

Yes, we offer volume discounts for high-usage customers. If your usage is substantial, please contact our sales team who can create a custom pricing plan tailored to your needs.

How do I get started?

Sign up for an account, get your API key, and start using our models right away. We provide comprehensive documentation and code examples to help you integrate quickly.

Ready to accelerate your AI development?

Ready to accelerate your AI development?

Ready to accelerate your AI development?