Datadog US5 Outage History
Datadog US5 is up right nowDatadog US5 had 29 outages in the last 2 years totaling 50h 54m of downtime — averaging 1.2 incidents per month.
There were 29 Datadog US5 outages since July 3, 2024 totaling 50h 54m of downtime. Each is summarised below — incident details, duration, and resolution information.
Azure Metrics Reporting
Timeline · 5 updates
- investigating May 16, 2026, 02:20 AM UTC
We are investigating an issue submitting Azure metrics.
- identified May 16, 2026, 02:39 AM UTC
The issue has been identified and a fix is being implemented.
- identified May 16, 2026, 03:08 AM UTC
We are continuing to work on a fix for this issue.
- monitoring May 16, 2026, 03:27 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved May 16, 2026, 03:42 AM UTC
This incident has been resolved and Azure metrics are reporting as expected.
Elevated Error Rates
Timeline · 4 updates
- investigating May 06, 2026, 12:25 PM UTC
We are actively investigating elevated error rates. As a result of this issue, some users may see errors when trying to load resources on the web application or API.
- identified May 06, 2026, 12:27 PM UTC
The issue has been identified and a fix is being implemented.
- monitoring May 06, 2026, 12:32 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved May 06, 2026, 12:41 PM UTC
This incident has been resolved.
APM Service - Detailed trace information not available in the UI
Timeline · 4 updates
- investigating Apr 10, 2026, 07:31 AM UTC
We are currently investigating an issue affecting the ability to view detailed information for APM traces. APM trace ingestion and monitoring are not affected.
- identified Apr 10, 2026, 08:00 AM UTC
The issue has been identified and a fix is being implemented.
- monitoring Apr 10, 2026, 08:23 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved Apr 10, 2026, 08:33 AM UTC
This incident has been resolved.
Intermittent "No Data" Status for Monitors
Timeline · 5 updates
- investigating Mar 19, 2026, 12:55 AM UTC
We are investigating an issue causing some metric monitors to intermittently report "No Data" status. This primarily affects monitors based on distribution metrics. Monitor alerting may be unreliable during this time. We are actively investigating and will provide updates as available.
- investigating Mar 19, 2026, 01:27 AM UTC
We are continuing to investigate the issue and will provide updates as available.
- investigating Mar 19, 2026, 02:04 AM UTC
We are continuing to investigate the issue and will provide updates as available.
- monitoring Mar 19, 2026, 02:39 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved Mar 19, 2026, 02:53 AM UTC
This incident has been resolved.
Degraded Performance affecting Login, APM and Logs
Timeline · 5 updates
- investigating Mar 10, 2026, 03:57 AM UTC
We’re investigating an issue in production causing intermittent password login failures and slow UI performance. APM traces are being ingested but may not display, and Logs Explorer may load slowly.
- identified Mar 10, 2026, 04:25 AM UTC
The issue has been identified and a fix is being implemented.
- identified Mar 10, 2026, 05:10 AM UTC
We are continuing to work on a fix for this issue.
- monitoring Mar 10, 2026, 05:29 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved Mar 10, 2026, 05:38 AM UTC
This incident has been resolved.
Delays in Monitor Evaluations
Timeline · 10 updates
Web Application Not Loading
Timeline · 3 updates
- investigating Jan 22, 2026, 07:02 PM UTC
Web Application Not Loading
- monitoring Jan 22, 2026, 07:13 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Jan 22, 2026, 07:27 PM UTC
This incident has been resolved.
Delayed Metrics
Timeline · 4 updates
- investigating Dec 13, 2025, 09:51 AM UTC
We are investigating increased latency processing Metrics. As a result of this issue, some users may see delays or gaps for metrics on graphs. To prevent spurious alerts, we have temporarily disabled monitors based on this data.
- identified Dec 13, 2025, 09:52 AM UTC
The issue has been identified and a fix is being implemented.
- monitoring Dec 13, 2025, 09:59 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved Dec 13, 2025, 10:12 AM UTC
This incident has been resolved.
Metric Monitors - Delayed Evaluation
Timeline · 4 updates
- investigating Dec 11, 2025, 08:11 PM UTC
We are investigating delays in Metric Monitors Evaluation, which began at 19:38 UTC.
- identified Dec 11, 2025, 08:36 PM UTC
The issue has been identified and a fix is being implemented.
- monitoring Dec 11, 2025, 08:49 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Dec 11, 2025, 09:08 PM UTC
This incident has been resolved.
Delayed Monitors Notifications
Timeline · 2 updates
- investigating Nov 18, 2025, 01:20 PM UTC
We are investigating delays in RUM-based Monitors Notifications, which began at 11:30am UTC.
- resolved Nov 18, 2025, 01:56 PM UTC
This incident has been resolved. Notification delays were only affecting our internal monitoring and were due to the ongoing Cloudflare incident: https://www.cloudflarestatus.com/incidents/8gmgl950y3h7/.
Delays in APM trace metrics
Timeline · 5 updates
- identified Oct 27, 2025, 04:30 PM UTC
We've identified delays in processing metrics generated from APM traces. We have temporarily disabled monitors that utilize this data. Customers may experience delays of up to 10min before seeing APM metrics in app.
- monitoring Oct 27, 2025, 04:36 PM UTC
A fix has been implemented and we are monitoring the results.
- identified Oct 27, 2025, 05:36 PM UTC
We’ve seeing delays in evaluating SLO monitors. We have temporarily disabled monitors that utilize this data.
- monitoring Oct 27, 2025, 08:05 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Oct 27, 2025, 08:21 PM UTC
This incident has been resolved.
Login Errors
Timeline · 3 updates
- investigating Oct 23, 2025, 05:53 AM UTC
We are investigating user login issues with the web application. Please note that data processing and alerts are not affected by this incident.
- monitoring Oct 23, 2025, 06:41 AM UTC
A fix has been implemented and we are monitoring the results.
- resolved Oct 23, 2025, 06:50 AM UTC
This incident has been resolved.
[SSO] Login Errors from Google SSO
Timeline · 3 updates
- investigating Sep 18, 2025, 03:15 PM UTC
We are investigating user login issues with the web application via Google SSO. Please note that data processing and alerts are not affected by this incident.
- monitoring Sep 18, 2025, 03:47 PM UTC
We are seeing recovery in Google SSO logins. We are continuing to monitor for issues.
- resolved Sep 18, 2025, 04:24 PM UTC
This incident has been resolved.
Pagerduty Monitor Notifications Delayed
Timeline · 3 updates
- investigating Aug 28, 2025, 04:43 AM UTC
Monitor Notifications are delayed for Pagerduty.
- monitoring Aug 28, 2025, 08:02 AM UTC
We are observing some recovery of notifications delays and continue to monitor the situation. Please follow our integration status page for details https://datadogintegrations.statuspage.io/
- resolved Aug 28, 2025, 09:07 AM UTC
PagerDuty notifications deliveries are back to normal.
Delayed Data Ingest
Timeline · 4 updates
- investigating Aug 06, 2025, 03:35 AM UTC
We are experiencing delays with significant lag in data ingestion and monitoring. This impacts multiple Datadog products. We will provide more specific updates shortly
- identified Aug 06, 2025, 04:20 AM UTC
We have identified the source of the issue and are working on a solution
- monitoring Aug 06, 2025, 05:25 AM UTC
The issue have been resolved at our partner side. We are closely monitoring our services to avoid another degradation. We are working on an analysis to fully understand the impact.
- resolved Aug 06, 2025, 08:57 AM UTC
We confirmed the source of the issue with our partner and that the underlying impacted services have fully recovered. Datadog products have fully recovered and have been stable since our last update. This incident has been resolved.
Google SSO login errors
Timeline · 3 updates
- investigating Jul 18, 2025, 03:37 PM UTC
We are investigating user login issues with the web application via Google SSO. Please note that data processing and alerts are not affected by this incident.
- identified Jul 18, 2025, 03:43 PM UTC
Google declared an incident regarding this issue: https://www.google.com/appsstatus/dashboard/incidents/oFcAZTr4EVieF5Fr6Ee9
- resolved Jul 18, 2025, 04:10 PM UTC
This incident has been resolved.
Multiple components impacted by provider outage
Timeline · 7 updates
- investigating Jun 12, 2025, 06:26 PM UTC
We are currently investigating this issue.
- identified Jun 12, 2025, 06:54 PM UTC
The issue has been identified and a fix is being implemented.
- identified Jun 12, 2025, 07:00 PM UTC
An issue with Google Cloud Services https://status.cloud.google.com/ is still affecting multiple components.
- identified Jun 12, 2025, 07:54 PM UTC
We are still experiencing issues across our services and are monitoring GCS Health https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1SsW
- monitoring Jun 12, 2025, 08:51 PM UTC
We are are starting to see some recovery with our core services (Log Management, Metrics, Monitors) and are continuing to monitor the situation.
- monitoring Jun 12, 2025, 09:26 PM UTC
We have recovered core services (Log Management, Metrics, Monitors). There will be some delays as we process through our backlogs.
- resolved Jun 12, 2025, 09:42 PM UTC
This incident has been resolved.
Login Issues
Timeline · 5 updates
- investigating Mar 25, 2025, 07:06 PM UTC
We are investigating user login issues related to reCAPTCHA for customers using password login. If you experience an issue with reCAPTCHA, refreshing the page can often mitigate the issue. Please note that data processing and alerts are not affected by this incident.
- identified Mar 25, 2025, 07:24 PM UTC
The issue has been identified and a fix is being implemented.
- identified Mar 25, 2025, 09:33 PM UTC
We are continuing to work on a fix for this issue.
- monitoring Mar 25, 2025, 10:28 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Mar 25, 2025, 10:59 PM UTC
This incident has been resolved.
Degraded Web Application Performance
Timeline · 2 updates
- investigating Mar 11, 2025, 03:58 PM UTC
We are investigating degraded performance with the web application.
- resolved Mar 11, 2025, 04:35 PM UTC
This incident has been resolved.
Delayed data for Data Jobs Monitoring
Timeline · 3 updates
- identified Feb 14, 2025, 01:03 PM UTC
The issue has been identified and a fix is being implemented.
- monitoring Feb 14, 2025, 01:11 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Feb 14, 2025, 01:12 PM UTC
This incident has been resolved.
Delayed data in APM
Timeline · 4 updates
- investigating Feb 14, 2025, 12:17 PM UTC
We are currently investigating issues regarding delayed data in APM Traces
- identified Feb 14, 2025, 12:30 PM UTC
The issue has been identified and a fix is being implemented.
- monitoring Feb 14, 2025, 01:11 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Feb 14, 2025, 01:12 PM UTC
This incident has been resolved.
Delayed APM data ingestion
Timeline · 3 updates
- investigating Nov 26, 2024, 07:13 PM UTC
We are investigating increased ingestion latency of APM data.
- monitoring Nov 26, 2024, 07:36 PM UTC
A fix has been implemented and we are monitoring the results.
- resolved Nov 26, 2024, 07:44 PM UTC
This incident has been resolved.
Delayed Notifications
Timeline · 3 updates
- monitoring Oct 21, 2024, 12:17 PM UTC
We are investigating delays in notifications, which began at 12pm UTC.
- monitoring Oct 21, 2024, 12:29 PM UTC
We are continuing to monitor for any further issues.
- resolved Oct 21, 2024, 12:33 PM UTC
This incident has been resolved.
Metrics Monitors are delayed in us5
Timeline · 2 updates
- investigating Oct 15, 2024, 12:35 AM UTC
We are investigating delays in Monitors Notifications, which began at 00:00 UTC.
- resolved Oct 15, 2024, 01:00 AM UTC
This incident has been resolved.