Tools
z-ai

GLM 4.6

z-ai/glm-4.6

GLM-4.6 expands the context window to 200K tokens and delivers higher coding benchmark scores and stronger real-world performance in coding tools, including more visually polished front-end generation. It improves reasoning with tool use during inference, performs better as a tool-using and search agent within agent frameworks, and aligns more naturally in writing and role-play.

Tool callingStructured outputReasoning
Context
128K
128,000 tokens
Max output
25K
25,000 tokens
Input price
$0.39
390 Gold Karma / 1M
Output price
$1.90
1,900 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "z-ai/glm-4.6",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from z-ai and the recommended set.

Browse all