Coding API AI Models - Pricing & Features

Coding API

claude-3-7-sonnet-20250219

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

gemini-2.5-pro (us-south1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (europe-north1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

claude-3-7-sonnet-20250219:max

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

gemini-2.5-pro (europe-west1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (europe-west8)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

claude-3-7-sonnet-20250219:high

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

claude-sonnet-4-20250514

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Claude Sonnet 4 significantly improves on Sonnet 3.7's industry-leading capabilities, excelling in coding with a state-of-the-art 72.7% on SWE-bench. The model balances performance and efficiency for internal and external use cases, with enhanced steerability for greater control over implementations.

View Details →

Coding API

gemini-2.5-pro (us-west1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (europe-west4)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-flash (us-central1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (us-south1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (europe-west1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

claude-opus-4-20250514

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

32K tokens

Input

$15.00/M tokens

Output

$75.00/M tokens

Claude Opus 4 is Anthropic's most powerful model yet and the best coding model in the world, leading on SWE-bench (72.5%) and Terminal-bench (43.2%). It delivers sustained performance on long-running tasks that require focused effort and thousands of steps, with the ability to work continuously for several hours—dramatically outperforming all Sonnet models and significantly expanding what AI agents can accomplish.

View Details →

Coding API

gemini-2.5-flash (us-east5)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

claude-3-7-sonnet-20250219:16384

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

claude-3-7-sonnet-20250219:64000

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

claude-3-7-sonnet-20250219:medium

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

gemini-2.5-pro (us-east5)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-flash

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (us-west1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (europe-central2)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (europe-west4)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

claude-3-7-sonnet-20250219:1024

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

claude-3-7-sonnet-20250219:8192

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

gemini-2.5-pro

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (us-central1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (us-east1)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-pro (europe-central2)

Vision

Caching

Reasoning

Computer Use

Context Window

1.0M tokens

Max Output

66K tokens

Input

$1.25/M tokens

Output

$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

View Details →

Coding API

gemini-2.5-flash (europe-north1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

claude-3-7-sonnet-20250219:low

Vision

Caching

Reasoning

Computer Use

Context Window

200K tokens

Max Output

64K tokens

Input

$3.00/M tokens

Output

$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

View Details →

Coding API

gemini-2.5-flash (us-east1)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

gemini-2.5-flash (europe-west8)

Vision

Caching

Reasoning

Context Window

1.0M tokens

Max Output

66K tokens

Input

$0.3/M tokens

Output

$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

View Details →

Coding API

Features Overview

Privacy & Data Policy

Data Retention

Location

Privacy Policy

All Coding API Models

claude-3-7-sonnet-20250219

gemini-2.5-pro (us-south1)

gemini-2.5-pro (europe-north1)

claude-3-7-sonnet-20250219:max

gemini-2.5-pro (europe-west1)

gemini-2.5-pro (europe-west8)

claude-3-7-sonnet-20250219:high

claude-sonnet-4-20250514

gemini-2.5-pro (us-west1)

gemini-2.5-pro (europe-west4)

gemini-2.5-flash (us-central1)

gemini-2.5-flash (us-south1)

gemini-2.5-flash (europe-west1)

claude-opus-4-20250514

gemini-2.5-flash (us-east5)

claude-3-7-sonnet-20250219:16384

claude-3-7-sonnet-20250219:64000

claude-3-7-sonnet-20250219:medium

gemini-2.5-pro (us-east5)

gemini-2.5-flash

gemini-2.5-flash (us-west1)

gemini-2.5-flash (europe-central2)

gemini-2.5-flash (europe-west4)

claude-3-7-sonnet-20250219:1024

claude-3-7-sonnet-20250219:8192

gemini-2.5-pro

gemini-2.5-pro (us-central1)

gemini-2.5-pro (us-east1)

gemini-2.5-pro (europe-central2)

gemini-2.5-flash (europe-north1)

claude-3-7-sonnet-20250219:low

gemini-2.5-flash (us-east1)

gemini-2.5-flash (europe-west8)

Ready to use Coding API models?