Microsoft Azure AI
openai-responses/gpt-4.1-nano
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.
👁Vision🔧Tool calling⚡Caching
Pricing per 1M tokens
Input
$0.10
Output
$0.40
Cache write
$0.10
Cache read
$0.02
Specifications
Context window1.0M tokens
Max output33K tokens
API typechat
AddedApr 14, 2025
Model IDazure/openai-responses/gpt-4.1-nano
Privacy & data
Data retentionNo
Used for trainingNo
Provider location🇺🇸 US / 🇪🇺 EU
Privacy policyMicrosoft Privacy Statement →
