Qwen3-Next-80B-A3B-Instruct
About Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct is a next-generation foundation model released by Alibaba's Qwen team. It is built on the new Qwen3-Next architecture, designed for ultimate training and inference efficiency. The model incorporates innovative features such as a Hybrid Attention mechanism (Gated DeltaNet and Gated Attention), a High-Sparsity Mixture-of-Experts (MoE) structure, and various stability optimizations. As an 80-billion-parameter sparse model, it activates only about 3 billion parameters per token during inference, which significantly reduces computational costs and delivers over 10 times higher throughput than the Qwen3-32B model for long-context tasks exceeding 32K tokens. This is an instruction-tuned version optimized for general-purpose tasks and does not support 'thinking' mode. In terms of performance, it is comparable to Qwen's flagship model, Qwen3-235B, on certain benchmarks, showing significant advantages in ultra-long-context scenarios
Explore how Qwen3-Next-80B-A3B-Instruct's ultra-long context and efficient reasoning solve complex, large-scale problems.
Ultra-Long Document Synthesis
Process and synthesize insights from massive documents like legal briefs, research papers, or historical archives, leveraging its 1M token context.
Use Case Example:
"A legal team uses it to analyze 5000 pages of discovery documents, extracting key arguments and identifying relevant case law in minutes."
Large-Scale Codebase Analysis
Comprehend and optimize vast codebases by identifying architectural patterns, dependencies, and refactoring opportunities across millions of lines.
Use Case Example:
"An engineering firm employs it to refactor a legacy Python application, mapping module interactions and suggesting performance improvements for critical data pipelines."
Advanced Financial Market Intelligence
Analyze extensive real-time and historical financial data, news, and economic reports to predict market trends and formulate complex trading strategies.
Use Case Example:
"A financial analyst uses the model to process a decade of global market data and news articles, identifying subtle correlations for a new algorithmic trading strategy."
Comprehensive Regulatory Compliance
Automate auditing of complex regulatory frameworks and internal policies against operational data to ensure compliance and identify risks.
Use Case Example:
"A healthcare provider leverages it to cross-reference patient data handling with HIPAA regulations, flagging potential privacy violations and suggesting policy updates."
Scientific Discovery Acceleration
Accelerate research by analyzing vast scientific literature and experimental data to generate hypotheses, design experiments, and validate findings.
Use Case Example:
"A materials science researcher uses it to sift through thousands of journal articles and experimental results, proposing novel alloy compositions with desired properties."
Metadata
Specification
State
Deprecated
Architecture
Calibrated
No
Mixture of Experts
Yes
Total Parameters
80B
Activated Parameters
3B
Reasoning
No
Precision
FP8
Context length
262K
Max Tokens
262K
Compare with Other Models
See how this model stacks up against others.

Qwen
chat
Qwen3-VL-32B-Instruct
Release on: Oct 21, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.2
/ M Tokens
Output:
$
0.6
/ M Tokens

Qwen
chat
Qwen3-VL-32B-Thinking
Release on: Oct 21, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.2
/ M Tokens
Output:
$
1.5
/ M Tokens

Qwen
chat
Qwen3-VL-8B-Instruct
Release on: Oct 15, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.18
/ M Tokens
Output:
$
0.68
/ M Tokens

Qwen
chat
Qwen3-VL-8B-Thinking
Release on: Oct 15, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.18
/ M Tokens
Output:
$
2
/ M Tokens

Qwen
chat
Qwen3-VL-235B-A22B-Instruct
Release on: Oct 4, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.3
/ M Tokens
Output:
$
1.5
/ M Tokens

Qwen
chat
Qwen3-VL-235B-A22B-Thinking
Release on: Oct 4, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.45
/ M Tokens
Output:
$
3.5
/ M Tokens

Qwen
chat
Qwen3-VL-30B-A3B-Instruct
Release on: Oct 5, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.29
/ M Tokens
Output:
$
1
/ M Tokens

Qwen
chat
Qwen3-VL-30B-A3B-Thinking
Release on: Oct 11, 2025
Total Context:
262K
Max output:
262K
Input:
$
0.29
/ M Tokens
Output:
$
1
/ M Tokens

Qwen
image-to-video
Wan2.2-I2V-A14B
Release on: Aug 13, 2025
$
0.29
/ Video
