Qwen: Qwen3 Next 80B A3B Thinking

qwen/qwen3-next-80b-a3b-thinking

Description

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured thinking traces by default. It is designed for hard multi-step problems such as math proofs, code synthesis and debugging, logic, and agentic planning. It reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned to follow complex instructions while reducing repetitive or off-task behavior. The model is suitable for agent frameworks and tool use (function calling), retrieval-heavy workflows, and standardized benchmarking where step-by-step solutions are required. It supports long, detailed completions and leverages throughput-oriented techniques such as multi-token prediction for faster generation. Note that it operates in thinking-only mode.

How this model compares

Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.

Price (per 1M tokens)

Min
Max
This model
339 models in this groupPrice (per 1M tokens)
Min
$0.04
Avg
$12.395447
Max
$750.00
This model: $0.78 / 1M tokens

Context length (tokens)

Min
Max
This model
339 models in this groupContext length (tokens)
Min
4,095 tokens
Avg
379,884.782 tokens
Max
10,000,000 tokens
This model: 262,144 tokens

Capabilities

Text → TextContext: 131,072 tokens
Input:
Text
Output:
Text
    Qwen: Qwen3 Next 80B A3B Thinking