Qwen: Qwen3.5 397B A17B

qwen/qwen3.5-397b-a17b

Description

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions. With its robust code-generation and agent capabilities, the model exhibits strong generalization across diverse agent.

How this model compares

Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.

Price (per 1M tokens)

Min
Max
This model
336 models in this groupPrice (per 1M tokens)
Min
$0.04
Avg
$12.381012
Max
$750.00
This model: $2.34 / 1M tokens

Context length (tokens)

Min
Max
This model
336 models in this groupContext length (tokens)
Min
4,095 tokens
Avg
382,115.467 tokens
Max
10,000,000 tokens
This model: 262,144 tokens

Capabilities

text+image+video->textContext: 262,144 tokens
Input:
TextImageVideo
Output:
Text
    Qwen: Qwen3.5 397B A17B