Hevo incident

Degraded performance across US cluster

Minor Resolved View vendor source →

Hevo experienced a minor incident on March 7, 2025 affecting Data Pipelines, lasting 2h 15m. The incident has been resolved; the full update timeline is below.

Started
Mar 07, 2025, 02:35 PM UTC
Resolved
Mar 07, 2025, 04:51 PM UTC
Duration
2h 15m
Detected by Pingoru
Mar 07, 2025, 02:35 PM UTC

Affected components

Data Pipelines

Update timeline

  1. investigating Mar 07, 2025, 02:35 PM UTC

    We are currently investigating the issue.

  2. identified Mar 07, 2025, 02:57 PM UTC

    The issue has been identified, and a fix is currently being implemented. During this process, the US cluster will remain in read-only mode for a period of time.

  3. monitoring Mar 07, 2025, 03:24 PM UTC

    The US cluster has been moved out of read-only mode, and we are monitoring the changes.

  4. resolved Mar 07, 2025, 04:51 PM UTC

    Thank you for your patience while we worked on this issue. We encountered an incident where tasks for all entities—Models, Pipeline Ingestion tasks, and Destination Load tasks—were not executing as expected. This incident has come up due to a recent outage on the US cluster. We have now resolved the issue. As a result, you may have noticed delays in ingestion/loading tasks or missed model run triggers. Moving forward, all scheduled tasks will run as expected and will be automatically recovered in the next schedule. Our development team has identified and implemented a fix for this issue. We truly appreciate your patience and understanding while we worked on resolving this. We sincerely apologise for any inconvenience this may have caused. Please note that if there are any pipelines/models where the schedule interval is high (about 3 hours or more) and you need the data on an urgent basis, please trigger these entities manually to make sure they run as expected.