Microsoft Azure AI
gpt-4.1-nano (uksouth)
For tasks that demand low latency, GPTβ4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding β even higher than GPTβ4o mini. Itβs ideal for tasks like classification or autocompletion.
Vision Support
Caching Support
Pricing
$0.1
Input tokens per million
$0.4
Output tokens per million
Caching Pricing
$0.1
Cache write per million
$0.02
Cache read per million
Technical Specifications
Context Window
1.0M tokens
Max Output Tokens
33K tokens
Global Availability
Last Updated
N/A