DataRobot incident

The custom workload is failing to start

Major Resolved View vendor source →

DataRobot experienced a major incident on January 8, 2025 affecting AI Apps and MLOps, lasting 2h 24m. The incident has been resolved; the full update timeline is below.

Started
Jan 08, 2025, 03:34 PM UTC
Resolved
Jan 08, 2025, 05:58 PM UTC
Duration
2h 24m
Detected by Pingoru
Jan 08, 2025, 03:34 PM UTC

Affected components

AI AppsMLOps

Update timeline

  1. investigating Jan 08, 2025, 03:34 PM UTC

    The Custom Workload is failing to start in the US MTSaaS environment. Existing running Custom Apps aren't affected. The Engineering team is investigating the root cause.

  2. resolved Jan 08, 2025, 05:58 PM UTC

    Engineering has identified the root cause and fixed the issue. The Custom Workloads are starting normally on the US MTSaaS environment, and the incident is resolved.