Zid incident

Partial Outage on stores - Hardware issue with our gateway affecting stores partially

Critical Resolved View vendor source →

Zid experienced a critical incident on April 6, 2024, lasting —. The incident has been resolved; the full update timeline is below.

Started
Apr 06, 2024, 03:30 AM UTC
Resolved
Apr 06, 2024, 03:30 AM UTC
Duration
Detected by Pingoru
Apr 06, 2024, 03:30 AM UTC

Update timeline

  1. resolved Apr 07, 2024, 10:41 AM UTC

    Partial outage for some stores from 5:24:03 on 6th April 2024 till 5:31:06 the same day. Total partial downtime 7 minutes and 3 seconds.

  2. postmortem Apr 07, 2024, 10:42 AM UTC

    **What happened?** We suffered from a similar issue as the day before. This issue has been a hardware issue that affected our auto-scaling \(done when traffic increases\). ‌ **How long it lasted for?** 7 minutes and 3 seconds ‌ **What have we done to ensure it does not happen again?** We have rebuild the hardware and re-build the cluster in which our gateway runs. This will ensure that there is a brand new platform supporting it.