Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash

Description

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

How this model compares

Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.

Price (per 1M tokens)

Min

Max

This model

336 models in this groupPrice (per 1M tokens)

Min: $0.04
Avg: $12.574888
Max: $750.00

This model: $0.40 / 1M tokens

Context length (tokens)

Min

Max

This model

336 models in this groupContext length (tokens)

Min: 4,095 tokens
Avg: 398,336.839 tokens
Max: 2,000,000 tokens

This model: 202,752 tokens

Capabilities

Text → TextContext: 202,752 tokens

Input:

Text

Output:

Text

Z.ai: GLM 4.7 Flash