Wasabi incident

System Errors in US-EAST-1 & US-EAST-2 Regions

Minor Resolved View vendor source →

Wasabi experienced a minor incident on December 7, 2024 affecting US-East-1 (N. Virginia) and US-East-2 (N. Virginia), lasting 8h 14m. The incident has been resolved; the full update timeline is below.

Started
Dec 07, 2024, 04:14 PM UTC
Resolved
Dec 08, 2024, 12:28 AM UTC
Duration
8h 14m
Detected by Pingoru
Dec 07, 2024, 04:14 PM UTC

Affected components

US-East-1 (N. Virginia)US-East-2 (N. Virginia)

Update timeline

  1. investigating Dec 07, 2024, 04:14 PM UTC

    We are currently investigating an increase in 500 level HTTP responses on customer traffic to the us-east-1 and us-east-2 regions.

  2. investigating Dec 07, 2024, 05:14 PM UTC

    We are continuing to investigate the system errors in the us-east-1 and us-east-2 regions. We will update this page as we have more information.

  3. identified Dec 07, 2024, 06:25 PM UTC

    We have identified a power issue at the data center which is in the process of being restored. We will update this page as we have more information.

  4. identified Dec 07, 2024, 07:24 PM UTC

    We are continuing the process of restoring power and bringing systems back up. We will update this page as we have more information.

  5. identified Dec 07, 2024, 07:59 PM UTC

    Power has been fully restored to the us-east-2 region. We are continuing the process of restoration to the us-east-1 region and will update this page as we have more information.

  6. identified Dec 07, 2024, 09:29 PM UTC

    Power has been fully restored to the us-east-1 region and we are continuing to work on bringing all systems back online. We will update this page as we have more information.

  7. monitoring Dec 07, 2024, 10:17 PM UTC

    All systems are now back online and fully operational. We are continuing to monitor the regions and will update this page as we have more information.

  8. resolved Dec 08, 2024, 12:28 AM UTC

    Services in both regions have been restored. Please reach out to [email protected] if you see any issues related to this incident.

  9. postmortem Dec 16, 2024, 04:47 PM UTC

    On 7 December 2024 from 15:22 UTC to 21:30 UTC, Wasabi experienced a loss of power event in our US-EAST-1 and US-EAST-2 data centers. At 15:22, our Operations Team noticed that infrastructure within the US-EAST-1 and US-EAST-2 regions failed to respond to standard smoke tests and monitoring tools, and reviewing the activity for the regions indicated a full loss of power to all server racks and infrastructure within the building. At 16:00 UTC, Wasabi received confirmation from Iron Mountain that power loss for the entire building has occurred. By 16:15 UTC, Iron Mountain Operations begins to restore power to an incremental number of racks for Wasabi’s infrastructure, allowing our Operations Team to run systematic health checks across all server nodes, and by 21:30 UTC we have confirmation that all systems are running optimally and both US-EAST-1 and US-EAST-2 regions were fully operational.