Requesty

Cheapest AI models by price per million tokens

Ranked by combined input + output price per million tokens (excluding free-tier models). These are production-ready models that punch well above their price point — great defaults when cost matters and you can test model quality on your own workload.

  1. 🥇
    Novita AI logo
    meta-llama/llama-3.2-1b-instruct
    Novita AI·$0.02 in / $0.02 out
    $0.02 avg
  2. 🥈
    DeepInfra Inc. logo
    meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
    DeepInfra Inc.·$0.02 in / $0.05 out
    $0.03 avg
  3. 🥉
    Novita AI logo
    meta-llama/llama-3.2-3b-instruct
    Novita AI·$0.03 in / $0.05 out
    $0.04 avg
  4. 4
    Novita AI logo
    meta-llama/llama-3-8b-instruct
    Novita AI·$0.04 in / $0.04 out
    $0.04 avg
  5. 5
    Novita AI logo
    meta-llama/llama-3.1-8b-instruct
    Novita AI·$0.05 in / $0.05 out
    $0.05 avg
  6. 6
    Novita AI logo
    sao10k/l3-8b-lunaris
    Novita AI·$0.05 in / $0.05 out
    $0.05 avg
  7. 7
    Novita AI logo
    Sao10K/L3-8B-Stheno-v3.2
    Novita AI·$0.05 in / $0.05 out
    $0.05 avg
  8. 8
    Together AI Inc. logo
    meta-llama/Llama-3.2-3B-Instruct-Turbo
    Together AI Inc.·$0.06 in / $0.06 out
    $0.06 avg
  9. 9
    Novita AI logo
    gryphe/mythomax-l2-13b
    Novita AI·$0.09 in / $0.09 out
    $0.09 avg
  10. 10
    Together AI Inc. logo
    meta-llama/Meta-Llama-3-8B-Instruct-Lite
    Together AI Inc.·$0.10 in / $0.10 out
    $0.10 avg
  11. 11
    DeepInfra Inc. logo
    phi-4
    DeepInfra Inc.·$0.07 in / $0.14 out
    $0.10 avg
  12. 12
    OpenAI Inc. logo
    gpt-5-nano:flex
    OpenAI Inc.·$0.02 in / $0.20 out
    $0.11 avg
  13. 13
    DeepInfra Inc. logo
    Qwen/Qwen2.5-Coder-32B-Instruct
    DeepInfra Inc.·$0.07 in / $0.16 out
    $0.12 avg
  14. 14
    Alibaba Cloud logo
    qwen-turbo
    Alibaba Cloud·$0.05 in / $0.20 out
    $0.13 avg
  15. 15
    Novita AI logo
    nousresearch/hermes-2-pro-llama-3-8b
    Novita AI·$0.14 in / $0.14 out
    $0.14 avg
  16. 16
    Novita AI logo
    deepseek-r1-distill-qwen-14b
    Novita AI·$0.15 in / $0.15 out
    $0.15 avg
  17. 17
    Novita AI logo
    mistralai/mistral-nemo
    Novita AI·$0.17 in / $0.17 out
    $0.17 avg
  18. 18
    Together AI Inc. logo
    meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
    Together AI Inc.·$0.18 in / $0.18 out
    $0.18 avg
  19. 19
    Mistral AI SAS logo
    mistral-small-2503
    Mistral AI SAS·$0.10 in / $0.30 out
    $0.20 avg
  20. 20
    Mistral AI SAS logo
    devstral-small-2507
    Mistral AI SAS·$0.10 in / $0.30 out
    $0.20 avg
  21. 21
    Mistral AI SAS logo
    devstral-small-latest
    Mistral AI SAS·$0.10 in / $0.30 out
    $0.20 avg
  22. 22
    Together AI Inc. logo
    meta-llama/LlamaGuard-2-8b
    Together AI Inc.·$0.20 in / $0.20 out
    $0.20 avg
  23. 23
    DeepInfra Inc. logo
    Qwen/Qwen3-32B
    DeepInfra Inc.·$0.10 in / $0.30 out
    $0.20 avg
  24. 24
    DeepSeek logo
    deepseek-reasoner
    DeepSeek·$0.14 in / $0.28 out
    $0.21 avg
  25. 25
    DeepSeek logo
    deepseek-v4-flash
    DeepSeek·$0.14 in / $0.28 out
    $0.21 avg
  26. 26
    DeepSeek logo
    deepseek-chat
    DeepSeek·$0.14 in / $0.28 out
    $0.21 avg
  27. 27
    DeepInfra Inc. logo
    meta-llama/Llama-3.3-70B-Instruct-Turbo
    DeepInfra Inc.·$0.12 in / $0.30 out
    $0.21 avg
  28. 28
    Microsoft Azure AI logo
    gpt-5-nano@eastus2
    Microsoft Azure AI·$0.05 in / $0.40 out
    $0.22 avg
  29. 29
    Microsoft Azure AI logo
    gpt-5-nano@swedencentral
    Microsoft Azure AI·$0.05 in / $0.40 out
    $0.22 avg
  30. 30
    Microsoft Azure AI logo
    gpt-5-nano
    Microsoft Azure AI·$0.05 in / $0.40 out
    $0.22 avg

How we rank

Ranked by combined input + output price per million tokens. Models with a $0 tier are excluded so this list reflects production-priced options you can deploy against real traffic. Pricing is synced in real-time from upstream providers and Requesty charges no markup.

One API for every model on this list

Requesty is OpenAI-compatible and routes to 400+ models. Switch between any of the models above by changing one parameter in your code.

Get started free