Microsoft Azure AI

openai-responses/gpt-4.1-nano

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

👁Vision🔧Tool calling⚡Caching

Pricing per 1M tokens

Input

$0.10

Output

$0.40

Cache write

$0.10

Cache read

$0.02

Specifications

Context window1.0M tokens

Max output33K tokens

API typechat

AddedApr 14, 2025

Model IDazure/openai-responses/gpt-4.1-nano

Privacy & data

Data retentionNo

Used for trainingNo

Provider location🇺🇸 US / 🇪🇺 EU

Privacy policyMicrosoft Privacy Statement →

Try with Requesty All Microsoft Azure AI models