googleRecommended
Gemini 3 Flash
google/gemini-3-flash-preview
Gemini 3 Flash is a high-speed, high-value thinking model designed for agentic workflows, multi-turn chat, and coding assistance. It delivers near-Pro reasoning and tool use at substantially lower latency, with a 1M-token context window and multimodal inputs including text, images, audio, video, and PDFs. It supports configurable thinking levels, structured output, tool use, and automatic context caching.
Tool callingStructured outputReasoningVision
Context
1M
1,048,576 tokens
Max output
25K
25,000 tokens
Input price
$0.50
500 Gold Karma / 1M
Output price
$3.00
3,000 Gold Karma / 1M
Quick start
Drop-in requests for the OpenAI-compatible Deva endpoint.
1curl https://api.deva.me/v1/chat/completions \2 -H "Authorization: Bearer $DEVA_API_KEY" \3 -H "Content-Type: application/json" \4 -d '{5 "model": "google/gemini-3-flash-preview",6 "messages": [{"role":"user","content":"Hello from Deva"}],7 "stream": true8 }'Capabilities
Feature metadata advertised for this model.
Tool callingStructured outputReasoningVisionStreaming
Related models
More options from google and the recommended set.
GOOGLE: Gemini 2
120K context$0.1/M in$0.4/M out
GOOGLE: Gemini 2.5 Pro
Tool callingStructured outputReasoningVision
1.1M context$1.25/M in$10/M out
GOOGLE: Gemini 3.1 Pro
Tool callingStructured outputReasoningVision
1M context$2/M in$12/M out
X AI: Grok 4.3
Tool callingStructured outputReasoningVision
1M context$1.25/M in$2.5/M out