ImageKit incident

Error in serving requests in Mumbai region

Critical Resolved View vendor source →

ImageKit experienced a critical incident on March 15, 2023 affecting Image Transformation - MUM Region, lasting 3h 53m. The incident has been resolved; the full update timeline is below.

Started
Mar 15, 2023, 04:15 AM UTC
Resolved
Mar 15, 2023, 08:09 AM UTC
Duration
3h 53m
Detected by Pingoru
Mar 15, 2023, 04:15 AM UTC

Affected components

Image Transformation - MUM Region

Update timeline

  1. investigating Mar 15, 2023, 06:22 AM UTC

    We are currently investigating this issue.

  2. monitoring Mar 15, 2023, 06:27 AM UTC

    A fix has been implemented and we are monitoring the results.

  3. monitoring Mar 15, 2023, 06:30 AM UTC

    We are continuing to monitor for any further issues.

  4. resolved Mar 15, 2023, 08:09 AM UTC

    This incident has been resolved.

  5. postmortem Mar 15, 2023, 09:24 AM UTC

    Earlier today, we ran into issues with scaling our systems in the Mumbai region, primarily because of an edge case identified in AWS's instance allocation. This resulted in a high error rate for customers using the Mumbai processing region a couple of times. After identifying the issue, which lasted for about 16 minutes around 11:50AM IST and 8 minutes around 09:45AM IST, our team made some changes to get around this limitation and restore functionality in this region. The service is completely functional now with no degradation in response times either. Our team is also working on a long-term fix so that the corner case encountered today does not impact services in the future. We remain committed to ensuring an uptime as close to 100% as possible.