MiniMax: MiniMax-01

minimax/minimax-01

Description

MiniMax-01 combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context of up to 4 million tokens. The text model adopts a hybrid architecture that combines Lightning Attention, Softmax Attention, and Mixture-of-Experts (MoE). The image model adopts the “ViT-MLP-LLM” framework and is trained on top of the text model.

How this model compares

Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.

Price (per 1M tokens)

Min

Max

This model

336 models in this groupPrice (per 1M tokens)

Min: $0.04
Avg: $12.574888
Max: $750.00

This model: $1.10 / 1M tokens

Context length (tokens)

Min

Max

This model

336 models in this groupContext length (tokens)

Min: 4,095 tokens
Avg: 398,336.839 tokens
Max: 2,000,000 tokens

This model: 1,000,192 tokens

Capabilities

Text + Image → TextContext: 1,000,192 tokens

Input:

TextImage

Output:

Text

MiniMax: MiniMax-01