Cheapest AI models by price per million tokens
Ranked by combined input + output price per million tokens (excluding free-tier models). These are production-ready models that punch well above their price point — great defaults when cost matters and you can test model quality on your own workload.
- 🥇
meta-llama/llama-3.2-1b-instructNovita AI·$0.02 in / $0.02 out$0.02 avg$0.02 avg - 🥈
meta-llama/Meta-Llama-3.1-8B-Instruct-TurboDeepInfra Inc.·$0.02 in / $0.05 out$0.03 avg$0.03 avg - 🥉
meta-llama/llama-3.2-3b-instructNovita AI·$0.03 in / $0.05 out$0.04 avg$0.04 avg - 4
meta-llama/llama-3-8b-instructNovita AI·$0.04 in / $0.04 out$0.04 avg$0.04 avg - 5
meta-llama/llama-3.1-8b-instructNovita AI·$0.05 in / $0.05 out$0.05 avg$0.05 avg - 6
sao10k/l3-8b-lunarisNovita AI·$0.05 in / $0.05 out$0.05 avg$0.05 avg - 7
Sao10K/L3-8B-Stheno-v3.2Novita AI·$0.05 in / $0.05 out$0.05 avg$0.05 avg - 8
meta-llama/Llama-3.2-3B-Instruct-TurboTogether AI Inc.·$0.06 in / $0.06 out$0.06 avg$0.06 avg - 9
gryphe/mythomax-l2-13bNovita AI·$0.09 in / $0.09 out$0.09 avg$0.09 avg - 10
meta-llama/Meta-Llama-3-8B-Instruct-LiteTogether AI Inc.·$0.10 in / $0.10 out$0.10 avg$0.10 avg - 11
phi-4DeepInfra Inc.·$0.07 in / $0.14 out$0.10 avg$0.10 avg - 12
gpt-5-nano:flexOpenAI Inc.·$0.02 in / $0.20 out$0.11 avg$0.11 avg - 13
Qwen/Qwen2.5-Coder-32B-InstructDeepInfra Inc.·$0.07 in / $0.16 out$0.12 avg$0.12 avg - 14
qwen-turboAlibaba Cloud·$0.05 in / $0.20 out$0.13 avg$0.13 avg - 15
nousresearch/hermes-2-pro-llama-3-8bNovita AI·$0.14 in / $0.14 out$0.14 avg$0.14 avg - 16
deepseek-r1-distill-qwen-14bNovita AI·$0.15 in / $0.15 out$0.15 avg$0.15 avg - 17
mistralai/mistral-nemoNovita AI·$0.17 in / $0.17 out$0.17 avg$0.17 avg - 18
meta-llama/Meta-Llama-3.1-8B-Instruct-TurboTogether AI Inc.·$0.18 in / $0.18 out$0.18 avg$0.18 avg - 19
mistral-small-2503Mistral AI SAS·$0.10 in / $0.30 out$0.20 avg$0.20 avg - 20
devstral-small-2507Mistral AI SAS·$0.10 in / $0.30 out$0.20 avg$0.20 avg - 21
devstral-small-latestMistral AI SAS·$0.10 in / $0.30 out$0.20 avg$0.20 avg - 22
meta-llama/LlamaGuard-2-8bTogether AI Inc.·$0.20 in / $0.20 out$0.20 avg$0.20 avg - 23
Qwen/Qwen3-32BDeepInfra Inc.·$0.10 in / $0.30 out$0.20 avg$0.20 avg - 24
deepseek-reasonerDeepSeek·$0.14 in / $0.28 out$0.21 avg$0.21 avg - 25
deepseek-v4-flashDeepSeek·$0.14 in / $0.28 out$0.21 avg$0.21 avg - 26
deepseek-chatDeepSeek·$0.14 in / $0.28 out$0.21 avg$0.21 avg - 27
meta-llama/Llama-3.3-70B-Instruct-TurboDeepInfra Inc.·$0.12 in / $0.30 out$0.21 avg$0.21 avg - 28
gpt-5-nano@eastus2Microsoft Azure AI·$0.05 in / $0.40 out$0.22 avg$0.22 avg - 29
gpt-5-nano@swedencentralMicrosoft Azure AI·$0.05 in / $0.40 out$0.22 avg$0.22 avg - 30
gpt-5-nanoMicrosoft Azure AI·$0.05 in / $0.40 out$0.22 avg$0.22 avg
Explore other rankings
How we rank
Ranked by combined input + output price per million tokens. Models with a $0 tier are excluded so this list reflects production-priced options you can deploy against real traffic. Pricing is synced in real-time from upstream providers and Requesty charges no markup.
One API for every model on this list
Requesty is OpenAI-compatible and routes to 400+ models. Switch between any of the models above by changing one parameter in your code.
Get started free