Requesty

Best AI models for coding

SWE-Bench Verified measures how often a model can resolve real GitHub issues from 12 popular Python repositories. It is the most realistic coding benchmark available — scores here translate to the model's ability to ship actual pull requests, not just pass unit tests on toy problems.

  1. 🥇
    OpenAI Responses logo
    gpt-5.2-codex
    OpenAI Responses·$1.75 / $14.00 per 1M
    84.7%
  2. 🥈
    OpenAI Inc. logo
    gpt-5.4
    OpenAI Inc.·$2.50 / $15.00 per 1M
    82.1%
  3. 🥉
    OpenAI Inc. logo
    gpt-5.2
    OpenAI Inc.·$1.75 / $14.00 per 1M
    79.5%
  4. 4
    Anthropic PBC logo
    claude-opus-4-7
    Anthropic PBC·$5.00 / $25.00 per 1M
    78.6%
  5. 5
    Anthropic PBC logo
    claude-sonnet-4-6
    Anthropic PBC·$3.00 / $15.00 per 1M
    77.2%
  6. 6
    OpenAI Inc. logo
    gpt-5.1-chat
    OpenAI Inc.·$1.25 / $10.00 per 1M
    76.8%
  7. 7
    OpenAI Inc. logo
    gpt-5
    OpenAI Inc.·$1.25 / $10.00 per 1M
    74.9%
  8. 8
    Anthropic PBC logo
    claude-opus-4-6
    Anthropic PBC·$5.00 / $25.00 per 1M
    74.5%
  9. 9
    grok-4
    xAI Corp.·$3.00 / $15.00 per 1M
    72.5%
  10. 10
    OpenAI Inc. logo
    o3
    OpenAI Inc.·$2.00 / $8.00 per 1M
    71.7%
  11. 11
    Anthropic PBC logo
    claude-opus-4-5
    Anthropic PBC·$5.00 / $25.00 per 1M
    71.3%
  12. 12
    Anthropic PBC logo
    claude-sonnet-4-5
    Anthropic PBC·$3.00 / $15.00 per 1M
    70.8%
  13. 13
    MiniMax-M2
    MiniMax·$0.30 / $1.20 per 1M
    69.3%
  14. 14
    OpenAI Inc. logo
    gpt-4.1
    OpenAI Inc.·$2.00 / $8.00 per 1M
    68.1%
  15. 15
    Google LLC (Vertex AI) logo
    kimi-k2
    Google LLC (Vertex AI)·$0.60 / $2.50 per 1M
    65.8%
  16. 16
    Anthropic PBC logo
    claude-sonnet-4
    Anthropic PBC·$3.00 / $15.00 per 1M
    65.2%
  17. 17
    Google LLC (Gemini API) logo
    gemini-2.5-pro
    Google LLC (Gemini API)·$1.25 / $10.00 per 1M
    63.8%
  18. 18
    Google LLC (Vertex AI) logo
    claude-3-7-sonnet
    Google LLC (Vertex AI)·$3.00 / $15.00 per 1M
    62.3%
  19. 19
    OpenAI Inc. logo
    o3-mini
    OpenAI Inc.·$1.10 / $4.40 per 1M
    61.0%
  20. 20
    grok-3
    xAI Corp.·$5.00 / $25.00 per 1M
    58.3%
  21. 21
    OpenAI Inc. logo
    gpt-4.1-mini
    OpenAI Inc.·$0.40 / $1.60 per 1M
    55.1%
  22. 22
    Anthropic PBC logo
    claude-haiku-4-5
    Anthropic PBC·$1.00 / $5.00 per 1M
    54.2%
  23. 23
    Google LLC (Gemini API) logo
    gemini-2.5-flash
    Google LLC (Gemini API)·$0.30 / $2.50 per 1M
    53.2%
  24. 24
    Together AI Inc. logo
    deepseek-ai/DeepSeek-R1
    Together AI Inc.·$3.00 / $7.00 per 1M
    49.2%
  25. 25
    OpenAI Inc. logo
    o1
    OpenAI Inc.·$15.00 / $60.00 per 1M
    48.9%
  26. 26
    OpenAI Inc. logo
    gpt-4.1-nano
    OpenAI Inc.·$0.10 / $0.40 per 1M
    42.5%
  27. 27
    Together AI Inc. logo
    deepseek-ai/DeepSeek-V3
    Together AI Inc.·$1.25 / $1.25 per 1M
    42.0%
  28. 28
    OpenAI Inc. logo
    gpt-4o
    OpenAI Inc.·$2.50 / $10.00 per 1M
    38.0%
  29. 29
    Novita AI logo
    meta-llama/llama-3.3-70b-instruct
    Novita AI·$0.39 / $0.39 per 1M
    23.3%

How we rank

Scores for SWE-Bench Verified are sourced from official model cards, Artificial Analysis, and public leaderboards. When a model is available through multiple providers (e.g. Anthropic direct, AWS Bedrock, Google Vertex), we show one canonical entry per model family so the ranking isn't polluted by duplicates. Benchmarks measure specific skills — always validate on your own workload before committing.

One API for every model on this list

Requesty is OpenAI-compatible and routes to 400+ models. Switch between any of the models above by changing one parameter in your code.

Get started free