Qwen3-8B API, Deployment, Pricing

Qwen/Qwen3-8B

Qwen3-8B is the latest large language model in the Qwen series with 8.2B parameters. This model uniquely supports seamless switching between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue). It demonstrates significantly enhanced reasoning capabilities, surpassing previous QwQ and Qwen2.5 instruct models in mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment for creative writing, role-playing, and multi-turn dialogues. Additionally, it supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities

API Usage

curl --request POST \
  --url https://api.siliconflow.com/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "Qwen/Qwen3-8B",
  "stream": false,
  "max_tokens": 512,
  "enable_thinking": true,
  "thinking_budget": 4096,
  "min_p": 0.05,
  "temperature": 0.7,
  "top_p": 0.7,
  "top_k": 50,
  "frequency_penalty": 0.5,
  "n": 1,
  "stop": []
}'

Details

Model Provider

Qwen3

Type

text

Sub Type

chat

Size

8B

Publish Time

Apr 30, 2025

Input Price

$

0.06

/ M Tokens

Output Price

$

0.06

/ M Tokens

Context length

131K

Tags

Reasoning,8B,131K

Compare with Other Models

See how this model stacks up against others.

Model FAQs: Usage, Deployment

Learn how to use, fine-tune, and deploy this model with ease.

What is the Qwen3-8B model, and what are its core capabilities and technical specifications?

In which business scenarios does Qwen3-8B perform well? Which industries or applications is it suitable for?

How can the performance and effectiveness of Qwen3-8B be optimized in actual business use?

Compared with other models, when should Qwen3-8B be selected?

What are SiliconFlow's key strengths in AI serverless deployment for Qwen3-8B?

What makes SiliconFlow the top platform for Qwen3-8B API?

Ready to accelerate your AI development?

Ready to accelerate your AI development?

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.