Datadog Govcloud incident

Delayed Metrics

Major Resolved View vendor source →

Datadog Govcloud experienced a major incident on July 24, 2023 affecting Metrics and Infra Monitoring and Monitors, lasting 1h 34m. The incident has been resolved; the full update timeline is below.

Started
Jul 24, 2023, 09:10 PM UTC
Resolved
Jul 24, 2023, 10:44 PM UTC
Duration
1h 34m
Detected by Pingoru
Jul 24, 2023, 09:10 PM UTC

Affected components

Metrics and Infra MonitoringMonitors

Update timeline

  1. investigating Jul 24, 2023, 09:10 PM UTC

    We are investigating increased latency processing Metrics. As a result of this issue, some users may see delays or gaps for metrics on graphs. To prevent spurious alerts, we have temporarily disabled monitors based on this data.

  2. identified Jul 24, 2023, 09:13 PM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring Jul 24, 2023, 10:14 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. monitoring Jul 24, 2023, 10:21 PM UTC

    We are continuing to monitor for any further issues.

  5. resolved Jul 24, 2023, 10:44 PM UTC

    This incident has been resolved.