googleRecommended

Gemini 3 Flash

google/gemini-3-flash-preview

Gemini 3 Flash is a high-speed, high-value thinking model designed for agentic workflows, multi-turn chat, and coding assistance. It delivers near-Pro reasoning and tool use at substantially lower latency, with a 1M-token context window and multimodal inputs including text, images, audio, video, and PDFs. It supports configurable thinking levels, structured output, tool use, and automatic context caching.

Tool callingStructured outputReasoningVision

Context

1,048,576 tokens

Max output

25K

25,000 tokens

Input price

$0.50

1 Gold Karma / 1M

Output price

$3.00

3 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "google/gemini-3-flash-preview",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from google and the recommended set.

Browse all

GOOGLE: Gemini 2

120K context$0.1/M in$0.4/M out

GOOGLE: Gemini 2.5 Pro

Tool callingStructured outputReasoningVision

1.1M context$1.25/M in$10/M out

GOOGLE: Gemini 3.1 Pro

Tool callingStructured outputReasoningVision

1M context$2/M in$12/M out

GOOGLE: Gemini 3.5 Flash-Lite

Reasoning

1M context$0.3/M in$2.5/M out