Cloudera incident

CDW - Potential Degrade Performance

Notice Resolved View vendor source →

Cloudera experienced a notice incident on March 1, 2024 affecting Cloudera Data Warehouse, lasting 46m. The incident has been resolved; the full update timeline is below.

Started
Mar 01, 2024, 08:52 PM UTC
Resolved
Mar 01, 2024, 09:38 PM UTC
Duration
46m
Detected by Pingoru
Mar 01, 2024, 08:52 PM UTC

Affected components

Cloudera Data Warehouse

Update timeline

  1. identified Mar 01, 2024, 08:52 PM UTC

    We noted degraded application performance and are mitigating the issue right now .

  2. monitoring Mar 01, 2024, 09:18 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. monitoring Mar 01, 2024, 09:32 PM UTC

    We are continuing to monitor for any further issues.

  4. resolved Mar 01, 2024, 09:38 PM UTC

    This incident has been resolved.

  5. postmortem Mar 27, 2024, 06:17 PM UTC

    Our internal monitoring systems identified a decline in CDW control plane performance. Upon further investigation, we determined that a recent improvement to event handling resulted in an unexpected increase in database load. This issue has been rectified by successfully scaling up the database capacity. It is important to note that this behavior was not observed during our internal testing due to the broader variety of cluster configurations encountered in a production environment. Additionally, there was no impact on overall workloads.