Tools
Compare every agent-payable Deva resource by category, provider, capability, endpoint, and Gold Karma price.
X AI: Grok 4.3
LLM modelsGrok 4.3 is a reasoning model from xAI that accepts text and image inputs with text output, suited to agentic workflows, instruction-following, and applications requiring high factual accuracy. Reasoning effort is configurable (none, low, medium, or high), and a 1M-token context window with no output limit makes it well-suited to long-document analysis, deep research, and multi-step agentic tasks.
Platform fee included
by x-aiLLM models1M context1,250 Gold Karma / 1M input2,500 Gold Karma / 1M outputtext, image input
ANTHROPIC: Claude Opus 4.7
LLM modelsClaude Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on Opus 4.6's coding and agentic strengths, it delivers stronger performance on complex, multi-step tasks such as large codebases, multi-stage debugging, and end-to-end project orchestration, plus improved knowledge work from document drafting to data analysis, maintaining coherence across very long outputs and extended sessions.
Platform fee included
by anthropicLLM models1M context5,000 Gold Karma / 1M input25,000 Gold Karma / 1M outputtext, image, file input
ANTHROPIC: Claude Sonnet 4.6
LLM modelsClaude Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.
Platform fee included
by anthropicLLM models1M context3,000 Gold Karma / 1M input15,000 Gold Karma / 1M outputtext, image, file input
OPENAI: GPT 5.5
LLM modelsGPT-5.5 is OpenAI's frontier model for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It pairs a 1M+ token context window with text and image inputs, enabling large-scale reasoning, coding, and multimodal workflows in a single system.
Platform fee included
by openaiLLM models1.1M context5,000 Gold Karma / 1M input30,000 Gold Karma / 1M outputtext, image, file input
GOOGLE: Gemini 3 Flash
LLM modelsGemini 3 Flash is a high-speed, high-value thinking model designed for agentic workflows, multi-turn chat, and coding assistance. It delivers near-Pro reasoning and tool use at substantially lower latency, with a 1M-token context window and multimodal inputs including text, images, audio, video, and PDFs. It supports configurable thinking levels, structured output, tool use, and automatic context caching.
Platform fee included
by googleLLM models1M context500 Gold Karma / 1M input3,000 Gold Karma / 1M outputtext, image, audio, video, file input
GOOGLE: Gemini 3.1 Pro
LLM modelsGemini 3.1 Pro is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window, and introduces a medium thinking level to balance cost, speed, and performance. It excels at agentic coding, structured planning, and multimodal analysis.
Platform fee included
by googleLLM models1M context2,000 Gold Karma / 1M input12,000 Gold Karma / 1M outputtext, image, audio, video, file input
DEEPSEEK: DeepSeek V4 Pro
LLM modelsDeepSeek V4 Pro is a large-scale Mixture-of-Experts model with 1.6T total parameters (49B activated) and a 1M-token context window, built for advanced reasoning, coding, and long-horizon agent workflows. It uses a hybrid attention system for efficient long-context processing and supports high and xhigh reasoning efforts, suiting full-codebase analysis, multi-step automation, and large-scale synthesis.
Platform fee included
by deepseekLLM models1M context435 Gold Karma / 1M input870 Gold Karma / 1M outputtext input
DEEPSEEK: DeepSeek V4 Flash
LLM modelsDeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters (13B activated) and a 1M-token context window, designed for fast inference and high-throughput workloads while maintaining strong reasoning and coding performance. Hybrid attention enables efficient long-context processing, with high and xhigh reasoning efforts, making it ideal for coding assistants, chat systems, and agent workflows.
Platform fee included
by deepseekLLM models1M context98 Gold Karma / 1M input197 Gold Karma / 1M outputtext input
MOONSHOTAI: Kimi K2.6
LLM modelsKimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding across Python, Rust, and Go and turns prompts and visual inputs into production-ready interfaces, with an agent-swarm architecture that scales to hundreds of parallel sub-agents for autonomous task decomposition.
Platform fee included
by moonshotaiLLM models262K context684 Gold Karma / 1M input3,420 Gold Karma / 1M outputtext, image input
Z AI: GLM 5.1
LLM modelsGLM-5.1 delivers a major leap in coding capability, with especially strong gains on long-horizon tasks. Rather than minute-level interactions, it can work independently and continuously on a single task for more than eight hours, autonomously planning, executing, and refining its work to deliver complete, engineering-grade results.
Platform fee included
by z-aiLLM models203K context980 Gold Karma / 1M input3,080 Gold Karma / 1M outputtext input
OPENAI: GPT 4o
LLM modelsGPT-4o ('o' for 'omni') is OpenAI's multimodal model supporting text and image inputs with text output. It matches GPT-4 Turbo's intelligence while being roughly twice as fast and more cost-effective, with strong creative writing, file understanding, multilingual performance, and vision capabilities.
Platform fee included
by openaiLLM models120K context2,500 Gold Karma / 1M input10,000 Gold Karma / 1M outputtext, image, file input
OPENAI: GPT 4
LLM modelsGPT-4 Turbo is OpenAI's high-capability GPT-4 model with vision, supporting JSON mode and function calling on vision requests. Training data extends through December 2023.
Platform fee included
by openaiLLM models120K context10,000 Gold Karma / 1M input30,000 Gold Karma / 1M outputtext, image input
OPENAI: GPT 3
LLM modelsGPT-3.5 Turbo is OpenAI's fast, cost-effective model for chat and traditional completion tasks. It understands and generates natural language and code, with training data through September 2021.
Platform fee included
by openaiLLM models16K context500 Gold Karma / 1M input1,500 Gold Karma / 1M outputtext input
OPENAI: GPT 5 Mini
LLM modelsGPT-5 Mini is a compact version of GPT-5 for lighter-weight reasoning tasks. It keeps GPT-5's instruction-following and safety tuning while offering lower latency and cost, and succeeds OpenAI's o4-mini.
Platform fee included
by openaiLLM models400K context250 Gold Karma / 1M input2,000 Gold Karma / 1M outputtext, image, file input
OPENAI: GPT 5
LLM modelsGPT-5 is one of OpenAI's most advanced models, optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It delivers major gains in code quality and reasoning, with reduced hallucination and sycophancy and strong performance on coding, writing, and health-related tasks.
Platform fee included
by openaiLLM models400K context1,250 Gold Karma / 1M input10,000 Gold Karma / 1M outputtext, image, file input
OPENAI: GPT 5.1
LLM modelsGPT-5.1 is a frontier-grade model in the GPT-5 series with stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style. Its adaptive reasoning allocates compute dynamically, responding quickly to simple queries and going deeper on complex ones, delivering consistent gains across math, coding, and structured analysis with reliable tool use.
Platform fee included
by openaiLLM models400K context1,250 Gold Karma / 1M input10,000 Gold Karma / 1M outputtext, image, file input
ANTHROPIC: Claude 3 Haiku
LLM modelsClaude 3 Haiku is Anthropic's fastest and most compact Claude 3 model, built for near-instant, targeted responses with multimodal input support.
Platform fee included
by anthropicLLM models100K context250 Gold Karma / 1M input1,250 Gold Karma / 1M outputtext, image input
ANTHROPIC: Claude Sonnet 4.5
LLM modelsClaude Sonnet 4.5 is Anthropic's most advanced Sonnet model, optimized for real-world agents and coding. It posts state-of-the-art results on coding benchmarks such as SWE-bench Verified and is built for extended autonomous operation, with improved tool orchestration, speculative parallel execution, and efficient context and memory management. It suits software engineering, cybersecurity, financial analysis, and research agents.
Platform fee included
by anthropicLLM models1M context3,000 Gold Karma / 1M input15,000 Gold Karma / 1M outputtext, image, file input
ANTHROPIC: Claude Opus 4.5
LLM modelsClaude Opus 4.5 is Anthropic's frontier reasoning model, optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive coding and reasoning performance, and improved robustness to prompt injection, with a verbosity control (low, medium, or high) to trade off speed, depth, and token usage. It supports advanced tool use, extended context management, and coordinated multi-agent setups for autonomous research, debugging, and multi-step planning.
Platform fee included
by anthropicLLM models200K context5,000 Gold Karma / 1M input25,000 Gold Karma / 1M outputtext, image, file input
GOOGLE: Gemini 2
LLM modelsPlatform fee included
by googleLLM models120K context100 Gold Karma / 1M input400 Gold Karma / 1M output
GOOGLE: Gemini 2.5 Pro
LLM modelsGemini 2.5 Pro is Google's state-of-the-art model for advanced reasoning, coding, mathematics, and scientific tasks. Its 'thinking' capabilities let it reason through responses with greater accuracy and nuanced context handling, achieving top-tier results across benchmarks, including a first-place position on the LMArena leaderboard.
Platform fee included
by googleLLM models1.1M context1,250 Gold Karma / 1M input10,000 Gold Karma / 1M outputtext, image, audio, video, file input
DEEPSEEK: DeepSeek Chat v3.1
LLM modelsDeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) supporting both thinking and non-thinking modes. It extends DeepSeek-V3 with two-phase long-context training up to 128K tokens and FP8 inference, improving tool use, code generation, and reasoning efficiency to a level comparable with DeepSeek-R1 on hard benchmarks while responding faster. It supports structured tool calling and code and search agents for research, coding, and agentic workflows.
Platform fee included
by deepseekLLM models164K context200 Gold Karma / 1M input800 Gold Karma / 1M outputtext input
TOGETHER: Llama 3.1 8B Instruct
LLM modelsLlama 3.1 8B Instruct is the fast, efficient 8B instruction-tuned model in Meta's Llama 3.1 family, with strong performance against leading closed-source models in human evaluations.
Platform fee included
by togetherLLM models130K context1,800 Gold Karma / 1M input1,800 Gold Karma / 1M outputtext input
TOGETHER: Llama 3.1 70B Instruct
LLM modelsLlama 3.1 70B Instruct is the 70B instruction-tuned model in Meta's Llama 3.1 family, optimized for high-quality dialogue and competitive with leading closed-source models in human evaluations.
Platform fee included
by togetherLLM models130K context880 Gold Karma / 1M input880 Gold Karma / 1M outputtext input
TOGETHER: Llama 3.1 405B Instruct
LLM modelsPlatform fee included
by togetherLLM models130K context3,500 Gold Karma / 1M input3,500 Gold Karma / 1M output
TOGETHER: Mistral 7B Instruct
LLM modelsPlatform fee included
by togetherLLM models32K context200 Gold Karma / 1M input200 Gold Karma / 1M output
TOGETHER: Mixtral 8x7B Instruct
LLM modelsPlatform fee included
by togetherLLM models32K context600 Gold Karma / 1M input600 Gold Karma / 1M output
ANTHROPIC: Claude 3 Opus
LLM modelsDeprecatedPlatform fee included
by anthropicLLM models100K context13,000 Gold Karma / 1M input37,000 Gold Karma / 1M output
ANTHROPIC: Claude 3 Sonnet
LLM modelsDeprecatedPlatform fee included
by anthropicLLM models100K context4,500 Gold Karma / 1M input15,000 Gold Karma / 1M output
GOOGLE: Gemini 1.5
LLM modelsDeprecatedPlatform fee included
by googleLLM models120K context175 Gold Karma / 1M input175 Gold Karma / 1M output
TOGETHER: Llama 2 Chat
LLM modelsDeprecatedPlatform fee included
by togetherLLM models4K context700 Gold Karma / 1M input700 Gold Karma / 1M output
Z AI: GLM 4.6
LLM modelsGLM-4.6 expands the context window to 200K tokens and delivers higher coding benchmark scores and stronger real-world performance in coding tools, including more visually polished front-end generation. It improves reasoning with tool use during inference, performs better as a tool-using and search agent within agent frameworks, and aligns more naturally in writing and role-play.
Platform fee included
by z-aiLLM models128K context390 Gold Karma / 1M input1,900 Gold Karma / 1M outputtext input
QWEN: Qwen3 Coder
LLM modelsQwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts code generation model from the Qwen team, optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. It has 480 billion total parameters, with 35 billion active per forward pass (8 of 160 experts).
Platform fee included
by qwenLLM models128K context220 Gold Karma / 1M input1,800 Gold Karma / 1M outputtext input
MOONSHOTAI: Kimi K2
LLM modelsKimi K2 Instruct is a large-scale Mixture-of-Experts model from Moonshot AI with 1 trillion total parameters (32 billion active per forward pass), optimized for agentic capabilities including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use benchmarks, and supports long-context inference up to 128K tokens.
Platform fee included
by moonshotaiLLM models128K context570 Gold Karma / 1M input2,300 Gold Karma / 1M outputtext input