Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
Description
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions. With its robust code-generation and agent capabilities, the model exhibits strong generalization across diverse agent.
How this model compares
Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.
Price (per 1M tokens)
Min
Max
This model
336 models in this groupPrice (per 1M tokens)
- Min
- $0.04
- Avg
- $12.381012
- Max
- $750.00
This model: $2.34 / 1M tokens
Context length (tokens)
Min
Max
This model
336 models in this groupContext length (tokens)
- Min
- 4,095 tokens
- Avg
- 382,115.467 tokens
- Max
- 10,000,000 tokens
This model: 262,144 tokens
Capabilities
text+image+video->textContext: 262,144 tokens
Input:
TextImageVideo
Output:
Text