OpenAI Inc.

gpt-4.1-nano-2025-04-14

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Vision Support

Caching Support

Pricing

$0.1

Input tokens per million

$0.4

Output tokens per million

Caching Pricing

$0.1

Cache write per million

$0.02

Cache read per million

Technical Specifications

Context Window

1.0M tokens

Max Output Tokens

33K tokens

Global Availability

Last Updated

N/A

Provider

OpenAI Inc.

Location

🇺🇸 US

Visit Website →

Privacy & Data

Data Retention

Used for Training

OpenAI Privacy Policy →

Get Started

Try with Requesty Browse All OpenAI Inc. Models →