Jenzabar incident

VMware Host Failure in East Coast Datacenter

Notice Resolved View vendor source →

Jenzabar experienced a notice incident on October 14, 2024 affecting North America East, lasting 3h 59m. The incident has been resolved; the full update timeline is below.

Started
Oct 14, 2024, 03:31 PM UTC
Resolved
Oct 14, 2024, 07:31 PM UTC
Duration
3h 59m
Detected by Pingoru
Oct 14, 2024, 03:31 PM UTC

Affected components

North America East

Update timeline

  1. identified Oct 14, 2024, 03:31 PM UTC

    At approximately 10:20am EST this morning, one of the VMware hosts failed due to a memory module/DIMM failure. All virtual servers running on that host would have been immediately automatically powered off, then powered back on once transferred to a different host within the cluster. Total downtime would have been approximately 1-3 minutes. All systems should now be functioning normally, and the failed host will be repaired and brought back into production later today.

  2. identified Oct 14, 2024, 06:40 PM UTC

    IBM engineers are engaged to repair the failed memory module on the offline host. All systems continue to function normally at this time.

  3. resolved Oct 14, 2024, 07:31 PM UTC

    The failed memory modules have been replaced, the host is fully booted, and is again hosting virtual workloads. The issue is now fully resolved.