Tools
deepseekRecommended

DeepSeek V4 Flash

deepseek/deepseek-v4-flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters (13B activated) and a 1M-token context window, designed for fast inference and high-throughput workloads while maintaining strong reasoning and coding performance. Hybrid attention enables efficient long-context processing, with high and xhigh reasoning efforts, making it ideal for coding assistants, chat systems, and agent workflows.

Tool callingStructured outputReasoning
Context
1M
1,048,576 tokens
Max output
25K
25,000 tokens
Input price
$0.10
98 Gold Karma / 1M
Output price
$0.20
197 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "deepseek/deepseek-v4-flash",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from deepseek and the recommended set.

Browse all