Scale AI incident

Platform Outage

Critical Resolved View vendor source →

Scale AI experienced a critical incident on January 29, 2022 affecting API and Platform and 1 more component, lasting 37m. The incident has been resolved; the full update timeline is below.

Started
Jan 29, 2022, 12:20 AM UTC
Resolved
Jan 29, 2022, 12:57 AM UTC
Duration
37m
Detected by Pingoru
Jan 29, 2022, 12:20 AM UTC

Affected components

APIPlatformWeb ApplicationQuality AssuranceDocument AINucleus

Update timeline

  1. investigating Jan 29, 2022, 12:20 AM UTC

    We are currently experiencing a platform outage. All services are impacted, including the Scale Dashboard, API, and associated services. We are currently reviewing this issue.

  2. monitoring Jan 29, 2022, 12:24 AM UTC

    The platform has recovered. We are actively monitoring and root causing the issue.

  3. investigating Jan 29, 2022, 12:41 AM UTC

    The platform is experiencing a degraded service level and we are continuing to investigate. API and Website error rates are elevated.

  4. resolved Jan 29, 2022, 12:57 AM UTC

    We have identified an issue related to database index performance. We have completed deploying a fix for this issue. The platform is up and running with non-elevated error rates.