Models

Products

Pricing

Docs

Blog

About

Contact

Back to Models

Qwen3-8B

Qwen/Qwen3-8B

Qwen3-8B is the latest large language model in the Qwen series with 8.2B parameters. This model uniquely supports seamless switching between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue). It demonstrates significantly enhanced reasoning capabilities, surpassing previous QwQ and Qwen2.5 instruct models in mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment for creative writing, role-playing, and multi-turn dialogues. Additionally, it supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities

API Usage

cURL

Python

JavaScript

curl --request POST \
  --url https://api.ap.siliconflow.com/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "Qwen/Qwen3-8B",
  "stream": false,
  "max_tokens": 512,
  "enable_thinking": true,
  "thinking_budget": 4096,
  "min_p": 0.05,
  "temperature": 0.7,
  "top_p": 0.7,
  "top_k": 50,
  "frequency_penalty": 0.5,
  "n": 1,
  "stop": []
}'

Details

Model Provider

Qwen3

Type

text

Sub Type

chat

Size

8

Publish Time

Apr 30, 2025

Input Price

$

0.06

/ M Tokens

Output Price

$

0.06

/ M Tokens

Context length

131072

Tags

Reasoning,8B,128K

Open in Playround

API Reference

Ready to accelerate your AI development?

Ready to accelerate your AI development?

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·