Intermittent 500 Errors on Serverless/GenAI Inference API
Timeline · 3 updates
- investigating Jun 16, 2026, 10:52 PM UTC
We are actively investigating an issue causing elevated HTTP 500 error rates for customers utilizing our Serverless/GenAI Inference API. Customer Impact: Customers making calls to the inference API—specifically targeting /v1/* endpoints—will experience intermittent HTTP 500 errors and failed requests.
- monitoring Jun 16, 2026, 11:29 PM UTC
Our engineering teams have successfully begun implementing mitigation steps to resolve the connectivity issues affecting the inference API. We will provide another update once the API has fully recovered and error rates return to normal.
- resolved Jun 17, 2026, 12:29 AM UTC
The connectivity issues affecting our Serverless/GenAI Inference API have been fully resolved. Our engineering teams successfully completed the connectivity restoration. All systems should be functioning normally, and the endpoint should be fully operational.