Exalate incident

Nodes down due to routing incident in core infrastructure

Critical Resolved View vendor source →

Exalate experienced a critical incident on September 22, 2024 affecting Hosting platform, lasting 2d 16h. The incident has been resolved; the full update timeline is below.

Started
Sep 22, 2024, 07:40 PM UTC
Resolved
Sep 25, 2024, 12:09 PM UTC
Duration
2d 16h
Detected by Pingoru
Sep 22, 2024, 07:40 PM UTC

Affected components

Hosting platform

Update timeline

  1. investigating Sep 22, 2024, 07:40 PM UTC

    Due to a routing problem, a subset of nodes (currently about 80%) are unreachable. Root causing is ongoing and status updates will be provided every 2 hours.

  2. identified Sep 22, 2024, 08:11 PM UTC

    A containment has been agreed and is now being implemented.

  3. monitoring Sep 22, 2024, 08:41 PM UTC

    The fix has been implemented, we'll monitoring the environment for an additional day.

  4. investigating Sep 23, 2024, 10:57 AM UTC

    We have identified an issue where some nodes are unreachable due to a routing problem. We have started to investigate the issue and will provide further updates shortly.

  5. identified Sep 23, 2024, 11:35 AM UTC

    A fix is now being implemented.

  6. identified Sep 23, 2024, 02:28 PM UTC

    We are still addressing the issue affecting some of our nodes. As of now, 50% of the affected nodes are back online. Our team is working diligently to restore full functionality as soon as possible.

  7. monitoring Sep 23, 2024, 04:55 PM UTC

    All nodes are now back online. We will continue monitoring and further updates will be provided.

  8. resolved Sep 25, 2024, 12:09 PM UTC

    The issue affecting Exalate Cloud has been resolved. We are currently preparing a post-mortem report, which will be shared early next week. Thank you for your patience and understanding.

  9. postmortem Apr 01, 2026, 04:12 AM UTC

    timeout