Platform9 incident

Connectivity issues reaching the Platform9 control plane

Critical Resolved View vendor source →

Platform9 experienced a critical incident on August 16, 2018 affecting Platform9 Cloud, lasting 5h 15m. The incident has been resolved; the full update timeline is below.

Started
Aug 16, 2018, 11:20 PM UTC
Resolved
Aug 17, 2018, 04:36 AM UTC
Duration
5h 15m
Detected by Pingoru
Aug 16, 2018, 11:20 PM UTC

Affected components

Platform9 Cloud

Update timeline

  1. investigating Aug 16, 2018, 11:27 PM UTC

    We are investigating network issues impacting access to the control plane for several customers.

  2. investigating Aug 17, 2018, 12:06 AM UTC

    We are continuing to investigate the network outage. We appreciate your patience and apologize for the inconvenience.

  3. identified Aug 17, 2018, 12:36 AM UTC

    We have replaced a network device and are working to recover customer environments.

  4. monitoring Aug 17, 2018, 01:31 AM UTC

    Most customer environments should be back online now. We apologize again for the inconvenience

  5. monitoring Aug 17, 2018, 02:44 AM UTC

    We are continuing to monitor for any further issues.

  6. monitoring Aug 17, 2018, 02:44 AM UTC

    We are still seeing some systems being impacted by the outage and have not recovered. We are working on recovering these environments.

  7. monitoring Aug 17, 2018, 02:44 AM UTC

    We are continuing to monitor for any further issues.

  8. resolved Aug 17, 2018, 04:36 AM UTC

    All systems are back online. We will provide a root cause as soon as we have it.