Coding API

Specialized AI models for coding tasks.

📍 🌍 Global•33 models available•Visit Website →
33
Available Models
$1.88
Avg Input Price/M
$0.3
Cheapest Model
coding/gemini-2.5-flash@us-east5
$15.00
Most Expensive
coding/claude-opus-4-20250514

Features Overview

33
Vision Support
33
Advanced Reasoning
33
Caching Support
22
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

🌍 Global

All Coding API Models

View All Providers →
Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Claude Sonnet 4 significantly improves on Sonnet 3.7's industry-leading capabilities, excelling in coding with a state-of-the-art 72.7% on SWE-bench. The model balances performance and efficiency for internal and external use cases, with enhanced steerability for greater control over implementations.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
32K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Claude Opus 4 is Anthropic's most powerful model yet and the best coding model in the world, leading on SWE-bench (72.5%) and Terminal-bench (43.2%). It delivers sustained performance on long-running tasks that require focused effort and thousands of steps, with the ability to work continuously for several hours—dramatically outperforming all Sonnet models and significantly expanding what AI agents can accomplish.

Coding API

gemini-2.5-pro

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Ready to use Coding API models?

Access all Coding API models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Coding API AI Models - Pricing & Features | Requesty