DataRobot incident

Custom model services issue

Minor Resolved View vendor source →

DataRobot experienced a minor incident on October 15, 2024 affecting MLOps, lasting 3h 29m. The incident has been resolved; the full update timeline is below.

Started
Oct 15, 2024, 12:09 PM UTC
Resolved
Oct 15, 2024, 03:39 PM UTC
Duration
3h 29m
Detected by Pingoru
Oct 15, 2024, 12:09 PM UTC

Affected components

MLOps

Update timeline

  1. investigating Oct 15, 2024, 12:09 PM UTC

    We are observing an issue with the Custom model services on the US production environment causing Custom Models, Custom Jobs and Custom Apps to stop working. Engineering is currently investigating the issue.

  2. identified Oct 15, 2024, 12:41 PM UTC

    Engineering has identified the root cause and is currently working on fixing the issue.

  3. identified Oct 15, 2024, 02:49 PM UTC

    We are continuing to work on a fix for this issue.

  4. resolved Oct 15, 2024, 03:39 PM UTC

    The engineering has resolved the issue. This incident is now contained.