DataRobot Outage History

DataRobot is up right now

There were 10 DataRobot outages since February 13, 2026 totaling 175h 50m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.datarobot.com

Minor April 10, 2026

Delay in processing actual messages

Detected by Pingoru
Apr 10, 2026, 10:24 AM UTC
Resolved
Apr 10, 2026, 11:01 AM UTC
Duration
36m
Affected: MLOps
Timeline · 2 updates
  1. monitoring Apr 10, 2026, 10:24 AM UTC

    Processing actual messages on JP MTS is delayed due to autoscaling malfunction. Engineering scaled up the deployment to alleviate the issue. Root cause mitigation in progress

  2. resolved Apr 10, 2026, 11:01 AM UTC

    Engineering has applied the required infrastructure configuration changes. The service is operating normally and no further user impact is observed. Engineering will continue monitoring cluster health to ensure stability. The incident is now marked as Contained.

Read the full incident report →

Major April 9, 2026

Elevated Errors on Managed AI Cloud

Detected by Pingoru
Apr 09, 2026, 09:56 PM UTC
Resolved
Apr 10, 2026, 01:04 PM UTC
Duration
15h 7m
Affected: AI AppsNotebooks
Timeline · 4 updates
  1. investigating Apr 09, 2026, 09:56 PM UTC

    We're experiencing an elevated level of errors and are currently looking into the issue.

  2. monitoring Apr 09, 2026, 10:56 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. monitoring Apr 10, 2026, 11:31 AM UTC

    Engineering has applied changes to mitigate the elevated error rates. Services are now operating normally. We are continuing to monitor the system while investigating the cause of the issue.

  4. resolved Apr 10, 2026, 01:04 PM UTC

    Engineering has implemented the required fixes to resolve the elevated error rates. Services are now operating normally, and no further user impact has been observed. The team will continue to monitor the system to ensure stability. The incident is now considered contained.

Read the full incident report →

Minor March 30, 2026

Degraded Performance on DataRobot MTS due to Quay outage

Detected by Pingoru
Mar 30, 2026, 08:43 PM UTC
Resolved
Mar 31, 2026, 08:53 AM UTC
Duration
12h 10m
Affected: WebsiteWebsiteWebsiteAPIAPIAPIPredictionsPredictionsPredictionAutoMLAutoMLAutoMLAI Catalog and Data IngestAI Catalog and Data IngestAI Catalog and Data IngestAI AppsAI AppsAI AppsMLOpsMLOpsMLOpsPipelineGenerative AI LLM PlaygroundNotebooksGenerative AI VDB BuilderGenerative AI LLM PlaygroundNotebooksGenerative AI VDB BuilderGenerative AI LLM PlaygroundGenerative AI VDB Builder
Timeline · 3 updates
  1. identified Mar 30, 2026, 08:43 PM UTC

    Our engineering team has found the the Quay outage currently happening is causing degraded performance across the DataRobot platform. Engineering is currently monitoring the situation.

  2. identified Mar 30, 2026, 08:44 PM UTC

    We are continuing to work on a fix for this issue.

  3. resolved Mar 31, 2026, 08:53 AM UTC

    Quay.io functionality has been restored and DataRobot environments are fully stabilized.

Read the full incident report →

Minor March 13, 2026

Performance Degradation on Managed AI Cloud

Detected by Pingoru
Mar 13, 2026, 05:49 PM UTC
Resolved
Mar 13, 2026, 06:43 PM UTC
Duration
54m
Affected: API
Timeline · 3 updates
  1. investigating Mar 13, 2026, 05:49 PM UTC

    We are experiencing performance degradation on Managed AI Cloud.

  2. monitoring Mar 13, 2026, 06:31 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Mar 13, 2026, 06:43 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor March 11, 2026

Intermittent UI disruptions on Managed AI Cloud

Detected by Pingoru
Mar 11, 2026, 08:05 PM UTC
Resolved
Mar 17, 2026, 06:32 PM UTC
Duration
5d 22h
Affected: WebsiteAPI
Timeline · 4 updates
  1. investigating Mar 11, 2026, 08:05 PM UTC

    We are currently investigating this issue.

  2. monitoring Mar 11, 2026, 08:22 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. monitoring Mar 17, 2026, 06:31 PM UTC

    We are continuing to monitor for any further issues.

  4. resolved Mar 17, 2026, 06:32 PM UTC

    This incident has been resolved.

Read the full incident report →

Major March 11, 2026

Network issue related to Kubernetes in US cluster

Detected by Pingoru
Mar 11, 2026, 02:36 PM UTC
Resolved
Mar 11, 2026, 04:41 PM UTC
Duration
2h 5m
Affected: PredictionsMLOpsPipeline
Timeline · 4 updates
  1. investigating Mar 11, 2026, 02:36 PM UTC

    DataRobot is experiencing network issue related to Kubernetes in US Cluster. This will have impact on model deployment and predictions. Engineering is investigating the root cause.

  2. identified Mar 11, 2026, 03:06 PM UTC

    Engineering has identified the root cause of the problem and a mitigation is put in place.

  3. monitoring Mar 11, 2026, 03:19 PM UTC

    The mitigation implemented by Engineering has improved the network issue. The team is continuing to monitor the environment to ensure full recovery.

  4. resolved Mar 11, 2026, 04:41 PM UTC

    The mitigation implemented by Engineering has resolved the Kubernetes network issue, and the incident is now contained.

Read the full incident report →

Minor February 18, 2026

Degraded Performance on the DataRobot MTS due to Quay outage

Detected by Pingoru
Feb 18, 2026, 09:04 PM UTC
Resolved
Feb 18, 2026, 09:19 PM UTC
Duration
15m
Affected: WebsiteWebsiteWebsiteAPIAPIAPIPredictionsPredictionsPredictionAutoMLAutoMLAutoMLAI Catalog and Data IngestAI Catalog and Data IngestAI Catalog and Data IngestAI AppsAI AppsAI AppsMLOpsMLOpsMLOpsPipelineGenerative AI LLM PlaygroundNotebooksGenerative AI VDB BuilderGenerative AI LLM PlaygroundNotebooksGenerative AI VDB BuilderGenerative AI LLM PlaygroundGenerative AI VDB Builder
Timeline · 2 updates
  1. investigating Feb 18, 2026, 09:04 PM UTC

    Our engineering team has found the the Quay outage currently happening is causing degraded performance across the DataRobot platform.

  2. resolved Feb 18, 2026, 09:19 PM UTC

    This incident is now resolved.

Read the full incident report →

Major February 17, 2026

LLM blueprints deployments cannot be created

Detected by Pingoru
Feb 17, 2026, 03:48 PM UTC
Resolved
Feb 17, 2026, 03:55 PM UTC
Duration
7m
Affected: Generative AI LLM Playground
Timeline · 2 updates
  1. identified Feb 17, 2026, 03:48 PM UTC

    LLM blueprints deployments can not be created in JP MTS environment. Engineering is rolling back JP cluster to previous version to mitigate the issue.

  2. resolved Feb 17, 2026, 03:55 PM UTC

    Rollback of the JP cluster to the previous version is complete and the problem has been mitigated.

Read the full incident report →

Minor February 16, 2026

Agent Application Template Impacted After Moderations Library Upgrade.

Detected by Pingoru
Feb 16, 2026, 11:35 AM UTC
Resolved
Feb 16, 2026, 12:49 PM UTC
Duration
1h 13m
Affected: AI AppsAI AppsAI Apps
Timeline · 2 updates
  1. identified Feb 16, 2026, 11:35 AM UTC

    Agent application template is affected with the recent moderations library upgrade, fix is identified and mitigation is in progress.

  2. resolved Feb 16, 2026, 12:49 PM UTC

    New version of Agentic application template is released, the issue is resolved

Read the full incident report →

Minor February 13, 2026

Degraded Performance on the DataRobot US MTS

Detected by Pingoru
Feb 13, 2026, 08:39 PM UTC
Resolved
Feb 13, 2026, 09:31 PM UTC
Duration
52m
Affected: APIAI Catalog and Data Ingest
Timeline · 2 updates
  1. investigating Feb 13, 2026, 08:39 PM UTC

    We are observing issues on DataRobot US MTS environment. Users may experience degraded performance using APIs and data ingest services. The engineering team is currently investigating the root cause.

  2. resolved Feb 13, 2026, 09:31 PM UTC

    The incident has now been resolved. All services are now operational.

Read the full incident report →

Looking to track DataRobot downtime and outages?

Pingoru polls DataRobot's status page every 5 minutes and alerts you the moment it reports an issue — before your customers do.

  • Real-time alerts when DataRobot reports an incident
  • Email, Slack, Discord, Microsoft Teams, and webhook notifications
  • Track DataRobot alongside 5,000+ providers in one dashboard
  • Component-level filtering
  • Notification groups + maintenance calendar
Start monitoring DataRobot for free

5 free monitors · No credit card required