Operational metrics per provider, April 2026
Operational metrics per provider, April 2026
Switch metrics. Hover any row to see all three at once.
How reliable is each LLM provider in production? In April 2026 the top eight providers on the Requesty gateway (OpenAI, Anthropic, Vertex (Gemini), Bedrock, DeepSeek, Novita, xAI) sat at 95-99% success rate. Azure trailed at 78%, Vertex (Claude) at 84%, Mistral at 86%, and Moonshot at 6%, a real reliability outlier. Streaming adoption is bimodal too: Azure 68%, Anthropic 57%, everyone else under 30%.
Why it mattersProvider success rate translates directly into user-visible failures unless an application has a managed fallback chain. The 95-99% top tier is comfortably reliable; Vertex (Claude) and Azure visibly failing roughly 1 in 5 calls demands either a routing policy or active provider switching at the application layer to avoid sustained user pain.
Key findings
- 01Success is bimodal: top tier at 95 to 99%, Vertex (Claude) 84%, Azure 78%, Mistral 86%, Moonshot 6%.
- 02Streaming adoption is bimodal: Azure 68% and Anthropic 57%. Vertex (Claude) at 28%. Everyone else <10%.
- 03Cache hit rate ranges from Anthropic-direct 77% to Vertex (Claude) 24% (same model family, 3x spread).
Data
| Provider | Success rate(percent) | Streaming(percent) | Cache hit(percent) |
|---|---|---|---|
| xAI | 99.30% | 1.30% | 35.70% |
| DeepSeek | 98.30% | 2.80% | 48.30% |
| OpenAI | 98.00% | 7.20% | 36.40% |
| Novita | 97.20% | 2.30% | 31.90% |
| Anthropic | 96.00% | 56.90% | 77.50% |
| Vertex (Gemini) | 95.90% | 3.70% | 9.60% |
| Bedrock | 95.60% | 9.70% | 56.90% |
| Mistral | 86.30% | 8.00% | 4.10% |
| Vertex (Claude) | 84.40% | 27.60% | 23.50% |
| Azure | 78.00% | 68.30% | 41.00% |
| Moonshot | 6.20% | 4.80% | 88.20% |
Cite as
Cited in
- What the gateway saw in April 2026/blog/provider-trends-april-2026-agentic-share-latency
