Datadog US3 incident

Elevated Error Rates for Monitors

Major Resolved View vendor source →

Datadog US3 experienced a major incident on April 23, 2024 affecting CI Visibility and Cloud Network Monitoring and 1 more component, lasting 54m. The incident has been resolved; the full update timeline is below.

Started
Apr 23, 2024, 07:24 PM UTC
Resolved
Apr 23, 2024, 08:19 PM UTC
Duration
54m
Detected by Pingoru
Apr 23, 2024, 07:24 PM UTC

Affected components

CI VisibilityCloud Network MonitoringContinuous ProfilerMonitorsNDMRUMSynthetics

Update timeline

  1. investigating Apr 23, 2024, 07:24 PM UTC

    We are actively investigating elevated error rates for Monitors. Metrics monitors are not affected.

  2. investigating Apr 23, 2024, 07:52 PM UTC

    We are actively investigating elevated error rates for Monitors. As a result of this issue, some users may experience issues in addition with CI Visibility, NDM, NPM, Profiling, RUM and Synthetics

  3. identified Apr 23, 2024, 08:00 PM UTC

    The issue has been identified and a fix is being implemented.

  4. monitoring Apr 23, 2024, 08:10 PM UTC

    A fix has been implemented and we are monitoring the results.

  5. monitoring Apr 23, 2024, 08:13 PM UTC

    A fix has been implemented. Monitors have recovered and live data is now available. Backfilling is ongoing for historical data.

  6. resolved Apr 23, 2024, 08:19 PM UTC

    This incident has been resolved.