deepseekRecommended

DeepSeek V4 Flash

deepseek/deepseek-v4-flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters (13B activated) and a 1M-token context window, designed for fast inference and high-throughput workloads while maintaining strong reasoning and coding performance. Hybrid attention enables efficient long-context processing, with high and xhigh reasoning efforts, making it ideal for coding assistants, chat systems, and agent workflows.

Tool callingStructured outputReasoning

Context

1,048,576 tokens

Max output

25K

25,000 tokens

Input price

$0.10

0 Gold Karma / 1M

Output price

$0.20

0 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "deepseek/deepseek-v4-flash",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from deepseek and the recommended set.

Browse all

DEEPSEEK: DeepSeek Chat v3.1

Tool callingStructured outputReasoning

164K context$0.2/M in$0.8/M out

DEEPSEEK: DeepSeek V4 Pro

Tool callingStructured outputReasoning

1M context$0.43/M in$0.87/M out

X AI: Grok 4.3

Tool callingStructured outputReasoningVision

1M context$1.25/M in$2.5/M out

ANTHROPIC: Claude Opus 4.7

Tool callingStructured outputReasoningVision

1M context$5/M in$25/M out