Datadog EU Outage History

Datadog EU is up right now

Datadog EU had 32 outages in the last 2 years totaling 5h 58m of downtime — averaging 1.3 incidents per month.

There were 32 Datadog EU outages since June 4, 2024 totaling 5h 58m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.datadoghq.eu

Notice September 17, 2024

Delayed Processing for a Subset of Distribution Metrics

Detected by Pingoru
Sep 17, 2024, 10:03 PM UTC
Resolved
Sep 17, 2024, 11:35 PM UTC
Duration
1h 32m
Affected: Metrics and Infra Monitoring
Timeline · 4 updates
  1. identified Sep 17, 2024, 10:03 PM UTC

    We are investigating increased latency processing for a subset of Distribution Metrics. To prevent spurious alerts, we have temporarily disabled distribution monitors based on this data.

  2. monitoring Sep 17, 2024, 10:33 PM UTC

    We have deployed a fix and we are monitoring the results. We will provide another update once the issue is fully resolved.

  3. monitoring Sep 17, 2024, 11:05 PM UTC

    We are continuing to monitor for any further issues.

  4. resolved Sep 17, 2024, 11:35 PM UTC

    This incident has been resolved and the monitors have been re-enabled at this time.

Read the full incident report →

Minor August 30, 2024

Web UI features maybe hidden

Detected by Pingoru
Aug 30, 2024, 02:59 PM UTC
Resolved
Aug 30, 2024, 03:21 PM UTC
Duration
21m
Affected: Web Application
Timeline · 4 updates
  1. investigating Aug 30, 2024, 02:59 PM UTC

    We are currently investigating an issue, that is causing certain features to be hidden from our UI. There is no data loss or monitoring impact.

  2. identified Aug 30, 2024, 03:09 PM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring Aug 30, 2024, 03:13 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. resolved Aug 30, 2024, 03:21 PM UTC

    This incident has been resolved. Please refresh your Datadog web page to resolve the issue completely.

Read the full incident report →

Major August 27, 2024

Delayed logs for a subset of customers

Detected by Pingoru
Aug 27, 2024, 04:51 PM UTC
Resolved
Aug 27, 2024, 06:22 PM UTC
Duration
1h 30m
Affected: Log ManagementMonitors
Timeline · 4 updates
  1. investigating Aug 27, 2024, 04:51 PM UTC

    We are investigating increased latency processing logs for a subset of customers. As a result of this issue, some users may see delayed logs processing, and associated logs monitor are currently not evaluated to avoid false positives.

  2. investigating Aug 27, 2024, 05:34 PM UTC

    We are continuing to investigate the issue.

  3. monitoring Aug 27, 2024, 06:00 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. resolved Aug 27, 2024, 06:22 PM UTC

    This incident has been resolved.

Read the full incident report →

Major August 8, 2024

CI Visibility - Page Load issue

Detected by Pingoru
Aug 08, 2024, 04:00 PM UTC
Resolved
Aug 08, 2024, 04:13 PM UTC
Duration
13m
Affected: CI Visibility
Timeline · 3 updates
  1. investigating Aug 08, 2024, 04:00 PM UTC

    We have identified an issue that prevents some Software Delivery pages from loading. Also, Intelligent Test Runner, Quality Gates, GitHub PR comments and Static Analysis uploads are affected. The team is working on a fix.

  2. monitoring Aug 08, 2024, 04:05 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Aug 08, 2024, 04:13 PM UTC

    This incident has been resolved.

Read the full incident report →

Major July 17, 2024

Delayed Metrics Monitors Notifications

Detected by Pingoru
Jul 17, 2024, 05:06 PM UTC
Resolved
Jul 17, 2024, 05:55 PM UTC
Duration
49m
Affected: Monitors
Timeline · 5 updates
  1. investigating Jul 17, 2024, 05:06 PM UTC

    We are investigating delays in metrics based Monitors Notifications, which began at 5:40pm UTC.

  2. identified Jul 17, 2024, 05:10 PM UTC

    The issue has been identified and a fix is being implemented.

  3. identified Jul 17, 2024, 05:19 PM UTC

    We are continuing to work on a fix for this issue.

  4. monitoring Jul 17, 2024, 05:38 PM UTC

    A fix has been implemented and we are monitoring the results.

  5. resolved Jul 17, 2024, 05:55 PM UTC

    This incident has been resolved.

Read the full incident report →

Major July 3, 2024

We are investigating user login issues with the web application

Detected by Pingoru
Jul 03, 2024, 02:49 PM UTC
Resolved
Jul 03, 2024, 04:01 PM UTC
Duration
1h 12m
Affected: Web Application
Timeline · 4 updates
  1. investigating Jul 03, 2024, 02:49 PM UTC

    We are investigating user login issues with the web application login by email. Please note that data processing and alerts are not affected by this incident.

  2. identified Jul 03, 2024, 03:25 PM UTC

    We have identified the underlying issue and are working on a fix.

  3. monitoring Jul 03, 2024, 03:50 PM UTC

    We have deployed a fix and we are monitoring the results. We will provide another update once the issue is fully resolved.

  4. resolved Jul 03, 2024, 04:01 PM UTC

    This incident has been resolved.

Read the full incident report →

Major June 4, 2024

Delays on multiple products

Detected by Pingoru
Jun 04, 2024, 08:57 AM UTC
Resolved
Jun 04, 2024, 09:15 AM UTC
Duration
18m
Affected: APMLog ManagementRUMSynthetics
Timeline · 4 updates
  1. investigating Jun 04, 2024, 08:57 AM UTC

    We are investigating increased latency processing Events. As a result of this issue, some users may see delays or gaps in the event stream or for event queries on dashboards. To prevent spurious alerts, we have temporarily disabled monitors based on this data.

  2. monitoring Jun 04, 2024, 09:12 AM UTC

    We've implemented a fix, and we are monitoring the result. This incident caused delays in several products, including Logs, APM Traces, Real User Monitoring, Synthetics Test Results, and Audit Trail.

  3. monitoring Jun 04, 2024, 09:14 AM UTC

    We are continuing to monitor for any further issues.

  4. resolved Jun 04, 2024, 09:15 AM UTC

    This incident has been resolved. This incident did not affect Metrics or Infrastructure Monitoring. Between 08:25 UTC and 09:15 UTC, we experienced elevated query errors and delays ingesting data for Logs, as well as more minor impact for several other products including: APM Traces, Real User Monitoring, Synthetics, and Audit Trail. All data has since been processed, and systems are operating in real-time as normal.

Read the full incident report →