GS2 incident

Datadog outage

Major Resolved View vendor source →

GS2 experienced a major incident on March 8, 2023 affecting Datadog US1 Log Management and Datadog US1 Metrics, lasting 22h 50m. The incident has been resolved; the full update timeline is below.

Started
Mar 08, 2023, 07:43 AM UTC
Resolved
Mar 09, 2023, 06:33 AM UTC
Duration
22h 50m
Detected by Pingoru
Mar 08, 2023, 07:43 AM UTC

Affected components

Datadog US1 Log ManagementDatadog US1 Metrics

Update timeline

  1. investigating Mar 08, 2023, 07:43 AM UTC

    Due to a Datadog outage, our monitoring capabilities are lower than usual. The latest updates can be obtained from Datadog's status page. https://status.datadoghq.com/

  2. investigating Mar 08, 2023, 09:55 AM UTC

    The problem continues to be ongoing. We are hopeful that Datadog will work to restore service. We appreciate everyone's patience.

  3. investigating Mar 08, 2023, 01:31 PM UTC

    We are continuing to investigate this issue.

  4. investigating Mar 08, 2023, 03:47 PM UTC

    We are continuing to investigate this issue.

  5. investigating Mar 09, 2023, 01:07 AM UTC

    We are seeing a turnaround in the situation, but still have less monitoring capacity than normal.

  6. investigating Mar 09, 2023, 02:52 AM UTC

    We are continuing to investigate this issue.

  7. investigating Mar 09, 2023, 04:36 AM UTC

    We are slowly regaining our original surveillance capabilities. Thank you all for your patience.

  8. monitoring Mar 09, 2023, 06:18 AM UTC

    It does not appear to be fully recovered yet on the Datadog status dashboard, but as far as we can observe, it has fully regained its original monitoring capabilities. We will continue to monitor the situation.

  9. resolved Mar 09, 2023, 06:33 AM UTC

    The Datadog status dashboard is back to normal. We are now truly back to full monitoring capability. Thank you all for your help.