StrongDM incident

SDM Outage(US)

Notice Resolved View vendor source →

StrongDM experienced a notice incident on December 4, 2024 affecting Admin UI and API, lasting 2h 5m. The incident has been resolved; the full update timeline is below.

Started
Dec 04, 2024, 01:03 PM UTC
Resolved
Dec 04, 2024, 03:08 PM UTC
Duration
2h 5m
Detected by Pingoru
Dec 04, 2024, 01:03 PM UTC

Affected components

Admin UIAPI

Update timeline

  1. identified Dec 04, 2024, 01:03 PM UTC

    The cause of the outage is due to a change in our production db. Engineering has identified the cause and is working to remediate the issue. Updates to follow.

  2. monitoring Dec 04, 2024, 01:49 PM UTC

    A fix has been implemented and we are monitoring the results. This should be resolved. If you are still seeing issues please reach out to support: https://help.strongdm.com/hc/en-us/requests/new

  3. monitoring Dec 04, 2024, 01:56 PM UTC

    This issue does not appear to have impacted customers using our EU and UK Control Planes. This outage impacted the US Control Plane only.

  4. resolved Dec 04, 2024, 03:08 PM UTC

    This incident was resolved at 13:49 UTC. An RCA will be posted here within a week.

  5. postmortem Dec 20, 2024, 05:20 PM UTC

    On December 3rd, SDM released a server build that added a new index to the table tracking latency between nodes. While the build succeeded on the EU and UK Control Planes, it failed on the US Control Plane. Retrying the build on the US Control Plane left the index in an invalid state, which went undetected by our migration tools. The following day, December 4th, SDM released a server build that relied on the new index. Without the index, node-to-node latency was no longer stored, and approximately three minutes later, the routing system stopped attempting multi-hop routes. This issue was limited to the US Control Plane, as the index was successfully created in the EU and UK Control Planes. StrongDM has already taken steps with internal processes to ensure that this kind of issue does not repeat in the future. Thank you for your patience and understanding.