Qwen3-Embedding-4B

Qwen/Qwen3-Embedding-4B

Qwen3-Embedding-4B is the latest proprietary model in the Qwen3 Embedding series, specifically designed for text embedding and ranking tasks. Built upon the dense foundational models of the Qwen3 series, this 4B parameter model supports context lengths up to 32K and can generate embeddings with dimensions up to 2560. The model inherits exceptional multilingual capabilities supporting over 100 languages, along with long-text understanding and reasoning skills. It achieves excellent performance on the MTEB multilingual leaderboard (score 69.45) and demonstrates outstanding results across various tasks including text retrieval, code retrieval, text classification, clustering, and bitext mining. The model offers flexible vector dimensions (32 to 2560) and instruction-aware capabilities for enhanced performance in specific tasks and scenarios, providing an optimal balance between efficiency and effectiveness

API Usage

curl --request POST \
  --url https://api.ap.siliconflow.com/v1/embeddings \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "Qwen/Qwen3-Embedding-4B",
  "input": "Silicon flow embedding online: fast, affordable, and high-quality embedding services. come try it out!",
  "encoding_format": "float"
}'

Details

Model Provider

Qwen

Type

text

Sub Type

embedding

Size

0

Publish Time

Jun 6, 2025

Input Price

$

0.02

/ M Tokens

Context length

32768

Tags

2048 dim,32K

Ready to accelerate your AI development?

Ready to accelerate your AI development?

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.