Qwen3-Reranker-4B
About Qwen3-Reranker-4B
Qwen3-Reranker-4B is a powerful text reranking model from the Qwen3 series, featuring 4 billion parameters. It is engineered to significantly improve the relevance of search results by re-ordering an initial list of documents based on a query. This model inherits the core strengths of its Qwen3 foundation, including exceptional understanding of long-text (up to 32k context length) and robust capabilities across more than 100 languages. According to benchmarks, the Qwen3-Reranker-4B model demonstrates superior performance in various text and code retrieval evaluations
Discover how Qwen3-Reranker-4B elevates information retrieval by intelligently re-ordering search results, ensuring users find the most relevant content quickly across diverse languages and long texts.
Enterprise Search
Improve internal document search relevance, boosting employee productivity and information discovery.
Use Case Example:
"Re-ordered 100 internal HR policy documents for a query on "remote work guidelines," placing the most current and relevant policies at the top, saving employees significant search time."
E-commerce Reranking
Refine product recommendations by re-ordering items based on user queries and preferences, increasing conversion rates.
Use Case Example:
"For a user searching "gaming laptop," re-ranked 50 initial product matches to prioritize those with high-end GPUs and positive reviews, leading to a 15% increase in click-through rate."
Developer Resource Search
Enhance code search and documentation discovery for developers, quickly surfacing the most relevant solutions.
Use Case Example:
"Re-ranked 20 potential JavaScript code snippets for a specific React component integration query, highlighting the most efficient and up-to-date example, reducing development time."
Legal Document Relevance
Accelerate legal research by re-ordering case law, statutes, or contracts to prioritize the most pertinent information.
Use Case Example:
"For a query on "GDPR compliance for data breaches," re-ranked 30 legal documents, bringing the most recent and directly applicable regulations to the top, significantly aiding legal counsel in their review."
Metadata
Specification
State
Deprecated
Architecture
Dense Transformer
Calibrated
Yes
Mixture of Experts
No
Total Parameters
4B
Activated Parameters
4B
Reasoning
No
Precision
FP8
Context length
33K
Max Tokens
Compare with Other Models
See how this model stacks up against others.

Qwen
chat
Qwen3-VL-32B-Instruct
Release on: Oct 21, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.2
/ M Tokens
Output:
$
0.6
/ M Tokens

Qwen
chat
Qwen3-VL-32B-Thinking
Release on: Oct 21, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.2
/ M Tokens
Output:
$
1.5
/ M Tokens

Qwen
chat
Qwen3-VL-8B-Instruct
Release on: Oct 15, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.18
/ M Tokens
Output:
$
0.68
/ M Tokens

Qwen
chat
Qwen3-VL-8B-Thinking
Release on: Oct 15, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.18
/ M Tokens
Output:
$
2
/ M Tokens

Qwen
chat
Qwen3-VL-235B-A22B-Instruct
Release on: Oct 4, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.3
/ M Tokens
Output:
$
1.5
/ M Tokens

Qwen
chat
Qwen3-VL-235B-A22B-Thinking
Release on: Oct 4, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.45
/ M Tokens
Output:
$
3.5
/ M Tokens

Qwen
chat
Qwen3-VL-30B-A3B-Instruct
Release on: Oct 5, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.29
/ M Tokens
Output:
$
1
/ M Tokens

Qwen
chat
Qwen3-VL-30B-A3B-Thinking
Release on: Oct 11, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.29
/ M Tokens
Output:
$
1
/ M Tokens

Qwen
image-to-video
Wan2.2-I2V-A14B
Release on: Aug 13, 2025
$
0.29
/ Video
