Replicate incident

Prediction and Training status updates delayed

Replicate is currently experiencing a major incident affecting Streaming API and HTTP API and 1 more component, which began 1h ago. The vendor's full update timeline is below.

Started
May 21, 2026, 09:38 PM UTC
Resolved
Ongoing
Duration
● 1h 25m
Detected by Pingoru
May 21, 2026, 09:38 PM UTC

Affected components

Streaming APIHTTP APICPU HardwareA100 HardwarePlaygroundHome PageL40S HardwareH100 HardwareT4 Hardware

Update timeline

  1. identified May 21, 2026, 09:38 PM UTC

    Status: Identified Our message queues for prediction and training status updates are hitting capacity limits which are causing connection failures for queue consumers. We are in the process of bringing additional capacity online. Affected components Playground (Degraded performance) H100 Hardware (Degraded performance) HTTP API (Degraded performance) T4 Hardware (Degraded performance) CPU Hardware (Degraded performance) Home Page (Degraded performance) L40S Hardware (Degraded performance) A100 Hardware (Degraded performance) Streaming API (Degraded performance)