StarRez incident

Service Distruption - Core Applications - US-Central

Critical Resolved View vendor source →

StarRez experienced a critical incident on January 18, 2024 affecting Central US, lasting 3h 4m. The incident has been resolved; the full update timeline is below.

Started
Jan 18, 2024, 09:52 PM UTC
Resolved
Jan 19, 2024, 12:57 AM UTC
Duration
3h 4m
Detected by Pingoru
Jan 18, 2024, 09:52 PM UTC

Affected components

Central US

Update timeline

  1. investigating Jan 18, 2024, 09:52 PM UTC

    Customers within the US-Central region are experiencing a service disruptions with core StarRez applications . -Engineers are actively reviewing this issue with our upstream provider. -Next update expected within the next 3 hours, or as warranted by a change of events.

  2. monitoring Jan 18, 2024, 10:11 PM UTC

    Engineer's have identified an issue with the underlying hosts. We have migrated services to new hosts and all sites have recovered.

  3. monitoring Jan 18, 2024, 10:11 PM UTC

    We are continuing to monitor for any further issues.

  4. resolved Jan 19, 2024, 12:57 AM UTC

    This incident has been resolved.

  5. postmortem Jan 23, 2024, 04:53 PM UTC

    **US Central Outage - 18th of January 2024** At Jan 18 9:43 pm UTC, Blizzard conditions in the US Central region caused our upstream provider data-center to experience a power disruption. This impacted the regions control plane hosts resulting in required manual intervention to mitigate. **Resolution** StarRez initiated fallback procedures which resulted in customer applications coming back online at 10:08UTC. There was a small subset of customers whose SQL connectivity was impacted until 11:37 UTC. **Additional Information** Our upstream providers are taking steps to improve resiliency against power disruptions in the future.