Datadog US3 incident

Delayed monitor evaluation and query failures across multiple products

Critical Resolved View vendor source →

Datadog US3 experienced a critical incident on January 24, 2024 affecting APM and CI Visibility and 1 more component, lasting 32m. The incident has been resolved; the full update timeline is below.

Started
Jan 24, 2024, 03:13 PM UTC
Resolved
Jan 24, 2024, 03:45 PM UTC
Duration
32m
Detected by Pingoru
Jan 24, 2024, 03:13 PM UTC

Affected components

APMCI VisibilityCloud Network MonitoringLog ManagementRUMSynthetics

Update timeline

  1. investigating Jan 24, 2024, 03:13 PM UTC

    We are currently investigating this issue.

  2. monitoring Jan 24, 2024, 03:26 PM UTC

    We experienced an increased rate of query failure across multiple products including Logs Management, APM, RUM, Synthetics, CI Visibility, Error Tracking, Audit Logs, Database Monitoring and NPM. This resulted in delayed monitor evaluation and notification for a subset of monitors. We are monitoring recovery. Metric Monitors and queries are unaffected by this incident.

  3. monitoring Jan 24, 2024, 03:28 PM UTC

    We are continuing to monitor for any further issues.

  4. resolved Jan 24, 2024, 03:45 PM UTC

    This incident has been resolved.