Tools

Grok 4.3 is a reasoning model from xAI that accepts text and image inputs with text output, suited to agentic workflows, instruction-following, and applications requiring high factual accuracy. Reasoning effort is configurable (none, low, medium, or high), and a 1M-token context window with no output limit makes it well-suited to long-document analysis, deep research, and multi-step agentic tasks.

Platform fee included

by x-aiLLM models1M context1 Gold Karma / 1M input3 Gold Karma / 1M outputtext, image input

ANTHROPIC: Claude Opus 4.7

Claude Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on Opus 4.6's coding and agentic strengths, it delivers stronger performance on complex, multi-step tasks such as large codebases, multi-stage debugging, and end-to-end project orchestration, plus improved knowledge work from document drafting to data analysis, maintaining coherence across very long outputs and extended sessions.

Platform fee included

by anthropicLLM models1M context5 Gold Karma / 1M input25 Gold Karma / 1M outputtext, image, file input

ANTHROPIC: Claude Sonnet 4.6

Claude Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

Platform fee included

by anthropicLLM models1M context3 Gold Karma / 1M input15 Gold Karma / 1M outputtext, image, file input

OPENAI: GPT 5.5

GPT-5.5 is OpenAI's frontier model for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It pairs a 1M+ token context window with text and image inputs, enabling large-scale reasoning, coding, and multimodal workflows in a single system.

Platform fee included

by openaiLLM models1.1M context5 Gold Karma / 1M input30 Gold Karma / 1M outputtext, image, file input

GOOGLE: Gemini 3 Flash

Gemini 3 Flash is a high-speed, high-value thinking model designed for agentic workflows, multi-turn chat, and coding assistance. It delivers near-Pro reasoning and tool use at substantially lower latency, with a 1M-token context window and multimodal inputs including text, images, audio, video, and PDFs. It supports configurable thinking levels, structured output, tool use, and automatic context caching.

Platform fee included

by googleLLM models1M context1 Gold Karma / 1M input3 Gold Karma / 1M outputtext, image, audio, video, file input

GOOGLE: Gemini 3.1 Pro

Gemini 3.1 Pro is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window, and introduces a medium thinking level to balance cost, speed, and performance. It excels at agentic coding, structured planning, and multimodal analysis.

Platform fee included

by googleLLM models1M context2 Gold Karma / 1M input12 Gold Karma / 1M outputtext, image, audio, video, file input

DEEPSEEK: DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model with 1.6T total parameters (49B activated) and a 1M-token context window, built for advanced reasoning, coding, and long-horizon agent workflows. It uses a hybrid attention system for efficient long-context processing and supports high and xhigh reasoning efforts, suiting full-codebase analysis, multi-step automation, and large-scale synthesis.

Platform fee included

by deepseekLLM models1M context0 Gold Karma / 1M input1 Gold Karma / 1M outputtext input

DEEPSEEK: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters (13B activated) and a 1M-token context window, designed for fast inference and high-throughput workloads while maintaining strong reasoning and coding performance. Hybrid attention enables efficient long-context processing, with high and xhigh reasoning efforts, making it ideal for coding assistants, chat systems, and agent workflows.

Platform fee included

by deepseekLLM models1M context0 Gold Karma / 1M input0 Gold Karma / 1M outputtext input

MOONSHOTAI: Kimi K2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding across Python, Rust, and Go and turns prompts and visual inputs into production-ready interfaces, with an agent-swarm architecture that scales to hundreds of parallel sub-agents for autonomous task decomposition.

Platform fee included

by moonshotaiLLM models262K context1 Gold Karma / 1M input3 Gold Karma / 1M outputtext, image input

Z AI: GLM 5.1

GLM-5.1 delivers a major leap in coding capability, with especially strong gains on long-horizon tasks. Rather than minute-level interactions, it can work independently and continuously on a single task for more than eight hours, autonomously planning, executing, and refining its work to deliver complete, engineering-grade results.

Platform fee included

by z-aiLLM models203K context1 Gold Karma / 1M input3 Gold Karma / 1M outputtext input

Z AI: GLM 5.2

Platform fee included

by z-aiLLM models1M context1 Gold Karma / 1M input4 Gold Karma / 1M output

ANTHROPIC: Claude Sonnet 5

Platform fee included

by anthropicLLM models1M context2 Gold Karma / 1M input10 Gold Karma / 1M output

ANTHROPIC: Claude Fable 5

Platform fee included

by anthropicLLM models1M context10 Gold Karma / 1M input50 Gold Karma / 1M output

MOONSHOTAI: Kimi K3

Platform fee included

by moonshotaiLLM models1M context3 Gold Karma / 1M input15 Gold Karma / 1M output

GOOGLE: Gemini 3.5 Flash-Lite

Platform fee included

by googleLLM models1M context0 Gold Karma / 1M input3 Gold Karma / 1M output

GOOGLE: Gemini 3.6 Flash

Platform fee included

by googleLLM models1M context2 Gold Karma / 1M input8 Gold Karma / 1M output

ANTHROPIC: Claude Opus 5

Platform fee included

by anthropicLLM models1M context5 Gold Karma / 1M input25 Gold Karma / 1M output

ANTHROPIC: Claude Opus 5 (Fast)

Platform fee included

by anthropicLLM models1M context10 Gold Karma / 1M input50 Gold Karma / 1M output

QWEN: Qwen3.7 Flash

Platform fee included

by qwenLLM models1M context0 Gold Karma / 1M input0 Gold Karma / 1M output

OPENAI: GPT 4o

Tool callingStructured outputVision

GPT-4o ('o' for 'omni') is OpenAI's multimodal model supporting text and image inputs with text output. It matches GPT-4 Turbo's intelligence while being roughly twice as fast and more cost-effective, with strong creative writing, file understanding, multilingual performance, and vision capabilities.

Platform fee included

by openaiLLM models120K context3 Gold Karma / 1M input10 Gold Karma / 1M outputtext, image, file input

OPENAI: GPT 4

Tool callingStructured outputVision

GPT-4 Turbo is OpenAI's high-capability GPT-4 model with vision, supporting JSON mode and function calling on vision requests. Training data extends through December 2023.

Platform fee included

by openaiLLM models120K context10 Gold Karma / 1M input30 Gold Karma / 1M outputtext, image input

OPENAI: GPT 3

GPT-3.5 Turbo is OpenAI's fast, cost-effective model for chat and traditional completion tasks. It understands and generates natural language and code, with training data through September 2021.

Platform fee included

by openaiLLM models16K context1 Gold Karma / 1M input2 Gold Karma / 1M outputtext input

OPENAI: GPT 5 Mini

GPT-5 Mini is a compact version of GPT-5 for lighter-weight reasoning tasks. It keeps GPT-5's instruction-following and safety tuning while offering lower latency and cost, and succeeds OpenAI's o4-mini.

Platform fee included

by openaiLLM models400K context0 Gold Karma / 1M input2 Gold Karma / 1M outputtext, image, file input

OPENAI: GPT 5

GPT-5 is one of OpenAI's most advanced models, optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It delivers major gains in code quality and reasoning, with reduced hallucination and sycophancy and strong performance on coding, writing, and health-related tasks.

Platform fee included

by openaiLLM models400K context1 Gold Karma / 1M input10 Gold Karma / 1M outputtext, image, file input

OPENAI: GPT 5.1

GPT-5.1 is a frontier-grade model in the GPT-5 series with stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style. Its adaptive reasoning allocates compute dynamically, responding quickly to simple queries and going deeper on complex ones, delivering consistent gains across math, coding, and structured analysis with reliable tool use.

Platform fee included

by openaiLLM models400K context1 Gold Karma / 1M input10 Gold Karma / 1M outputtext, image, file input

ANTHROPIC: Claude 3 Haiku

Tool callingVision

Claude 3 Haiku is Anthropic's fastest and most compact Claude 3 model, built for near-instant, targeted responses with multimodal input support.

Platform fee included

by anthropicLLM models100K context0 Gold Karma / 1M input1 Gold Karma / 1M outputtext, image input

ANTHROPIC: Claude Sonnet 4.5

Claude Sonnet 4.5 is Anthropic's most advanced Sonnet model, optimized for real-world agents and coding. It posts state-of-the-art results on coding benchmarks such as SWE-bench Verified and is built for extended autonomous operation, with improved tool orchestration, speculative parallel execution, and efficient context and memory management. It suits software engineering, cybersecurity, financial analysis, and research agents.

Platform fee included

by anthropicLLM models1M context3 Gold Karma / 1M input15 Gold Karma / 1M outputtext, image, file input

ANTHROPIC: Claude Opus 4.5

Claude Opus 4.5 is Anthropic's frontier reasoning model, optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive coding and reasoning performance, and improved robustness to prompt injection, with a verbosity control (low, medium, or high) to trade off speed, depth, and token usage. It supports advanced tool use, extended context management, and coordinated multi-agent setups for autonomous research, debugging, and multi-step planning.

Platform fee included

by anthropicLLM models200K context5 Gold Karma / 1M input25 Gold Karma / 1M outputtext, image, file input

GOOGLE: Gemini 2

Platform fee included

by googleLLM models120K context0 Gold Karma / 1M input0 Gold Karma / 1M output

GOOGLE: Gemini 2.5 Pro

Gemini 2.5 Pro is Google's state-of-the-art model for advanced reasoning, coding, mathematics, and scientific tasks. Its 'thinking' capabilities let it reason through responses with greater accuracy and nuanced context handling, achieving top-tier results across benchmarks, including a first-place position on the LMArena leaderboard.

Platform fee included

by googleLLM models1.1M context1 Gold Karma / 1M input10 Gold Karma / 1M outputtext, image, audio, video, file input

DEEPSEEK: DeepSeek Chat v3.1

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) supporting both thinking and non-thinking modes. It extends DeepSeek-V3 with two-phase long-context training up to 128K tokens and FP8 inference, improving tool use, code generation, and reasoning efficiency to a level comparable with DeepSeek-R1 on hard benchmarks while responding faster. It supports structured tool calling and code and search agents for research, coding, and agentic workflows.

Platform fee included

by deepseekLLM models164K context0 Gold Karma / 1M input1 Gold Karma / 1M outputtext input

TOGETHER: Llama 3.1 8B Instruct

Llama 3.1 8B Instruct is the fast, efficient 8B instruction-tuned model in Meta's Llama 3.1 family, with strong performance against leading closed-source models in human evaluations.

Platform fee included

by togetherLLM models130K context2 Gold Karma / 1M input2 Gold Karma / 1M outputtext input

TOGETHER: Llama 3.1 70B Instruct

Llama 3.1 70B Instruct is the 70B instruction-tuned model in Meta's Llama 3.1 family, optimized for high-quality dialogue and competitive with leading closed-source models in human evaluations.

Platform fee included

by togetherLLM models130K context1 Gold Karma / 1M input1 Gold Karma / 1M outputtext input

TOGETHER: Llama 3.1 405B Instruct

Platform fee included

by togetherLLM models130K context4 Gold Karma / 1M input4 Gold Karma / 1M output

TOGETHER: Mistral 7B Instruct

Platform fee included

by togetherLLM models32K context0 Gold Karma / 1M input0 Gold Karma / 1M output

TOGETHER: Mixtral 8x7B Instruct

Platform fee included

by togetherLLM models32K context1 Gold Karma / 1M input1 Gold Karma / 1M output

ANTHROPIC: Claude 3 Opus

Platform fee included

by anthropicLLM models100K context13 Gold Karma / 1M input37 Gold Karma / 1M output

ANTHROPIC: Claude 3 Sonnet

Platform fee included

by anthropicLLM models100K context5 Gold Karma / 1M input15 Gold Karma / 1M output

GOOGLE: Gemini 1.5

Platform fee included

by googleLLM models120K context0 Gold Karma / 1M input0 Gold Karma / 1M output

TOGETHER: Llama 2 Chat

Platform fee included

by togetherLLM models4K context1 Gold Karma / 1M input1 Gold Karma / 1M output

Z AI: GLM 4.6

GLM-4.6 expands the context window to 200K tokens and delivers higher coding benchmark scores and stronger real-world performance in coding tools, including more visually polished front-end generation. It improves reasoning with tool use during inference, performs better as a tool-using and search agent within agent frameworks, and aligns more naturally in writing and role-play.

Platform fee included

by z-aiLLM models128K context0 Gold Karma / 1M input2 Gold Karma / 1M outputtext input

QWEN: Qwen3 Coder

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts code generation model from the Qwen team, optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. It has 480 billion total parameters, with 35 billion active per forward pass (8 of 160 experts).

Platform fee included

by qwenLLM models128K context0 Gold Karma / 1M input2 Gold Karma / 1M outputtext input

MOONSHOTAI: Kimi K2

Tool calling

Kimi K2 Instruct is a large-scale Mixture-of-Experts model from Moonshot AI with 1 trillion total parameters (32 billion active per forward pass), optimized for agentic capabilities including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use benchmarks, and supports long-context inference up to 128K tokens.

Platform fee included

by moonshotaiLLM models128K context1 Gold Karma / 1M input2 Gold Karma / 1M outputtext input

ANTHROPIC: Claude Opus 4.8