postmortem Mar 25, 2026, 05:09 PM UTC
**Impact Start Time \(UTC\)** - 03/16/2026 12:55 PM UTC **Impact End Time \(UTC\)** - 03/16/2026 01:44 PM UTC ## **Incident Summary** Updated on 03/24/2025 - On 03/16/2026, some NiCE CXone Mpower customers experienced degraded performance and intermittent errors within the CXone Mpower Expert application. The impact was caused by elevated system utilization resulting from increased traffic load, which placed excessive demand on the high-availability load-balancing layer. This led to increased response times and reduced request processing efficiency. The impact was resolved when engineers increased the pod capacity of both the web application tier and the load balancer later, thereby restoring system throughput, stabilizing response times, and returning the platform to normal operating conditions. ## **Root Cause** The impact was caused by elevated system utilization resulting from increased traffic load, which placed excessive demand on the high-availability load-balancing layer. This led to increased response times and reduced request processing efficiency. Further investigation determined that the performance degradation was caused by a combination of constrained application processing capacity and insufficient load-balancing scale under peak traffic conditions. Web worker throughput was limited due to a known performance constraint, which reduced the rate at which incoming requests could be processed. This condition was further amplified by the load balancing layer operating below its expected production capacity, resulting in suboptimal traffic distribution and increased system latency during periods of elevated demand. Engineering teams are implementing targeted preventive actions to improve auto-scaling responsiveness and ensure the timely promotion of critical performance fixes to production. ## **Corrective Actions** **Detection:** Internal teams received alerts indicating increased of high availability load balancer response times. Shortly after, customers reported degraded performance and intermittent errors within the CXone Mpower Expert application. **Remediation:** The impact was resolved when engineers increased the pod capacity of both the web application tier and the load balancer later, thereby restoring system throughput, stabilizing response times, and returning the platform to normal operating conditions. Completed on 03/16/2026. **Prevention:** Engineers will adjust the auto-scaling policies to improve responsiveness and enable more aggressive scale-out behavior during peak traffic periods, ensuring capacity aligns with demand in near real time. An update will be provided by End of Day \(EOD\) MT on 03/26/2026. Engineers will implement auto-scaling for the high-availability load-balancing layer to ensure traffic distribution capacity scales proportionally with application workload. An update will be provided by EOD MT on 03/26/2026. ## **Incident Timeline \(UTC\)** 03/16/2026 12:55 PM \(UTC\) - First customer case opened, and Tech Support \(TS\) engineers began the troubleshooting investigation. 03/16/2026 01:07 PM \(UTC\) - TS engineers notified the Network Operations Center \(NOC\) engineers about the reported customer impact; a major incident was proposed and confirmed. 03/16/2026 01:44 PM \(UTC\) - Impact was resolved after engineers made an increase in available resources and internal tests were successful. Impact and major incident resolved.