Datadog Govcloud incident

Delayed Cloud Provider Metrics

Critical Resolved View vendor source →

Datadog Govcloud experienced a critical incident on July 31, 2022 affecting Cloud Integrations and Monitors, lasting 1h 4m. The incident has been resolved; the full update timeline is below.

Started
Jul 31, 2022, 03:31 PM UTC
Resolved
Jul 31, 2022, 04:36 PM UTC
Duration
1h 4m
Detected by Pingoru
Jul 31, 2022, 03:31 PM UTC

Affected components

Cloud IntegrationsMonitors

Update timeline

  1. identified Jul 31, 2022, 03:31 PM UTC

    We are investigating increased latency processing all Cloud Provider metrics. We have identified the issue and are working on a fix. As a result of this issue, some users may see delays or gaps in graphs that contain these metrics. To prevent spurious alerts, we have temporarily disabled monitors based on this data.

  2. identified Jul 31, 2022, 03:52 PM UTC

    We are continuing to work on a fix for this issue.

  3. monitoring Jul 31, 2022, 04:30 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. resolved Jul 31, 2022, 04:36 PM UTC

    This incident has been resolved.