FloQast incident

Lambda Errors and Runtime Errors Experienced Throughout the Application

Major Resolved View vendor source →

FloQast experienced a major incident on May 7, 2026 affecting Platform - EMEA and Platform - APAC and 1 more component, lasting 8h 49m. The incident has been resolved; the full update timeline is below.

Started
May 07, 2026, 02:28 PM UTC
Resolved
May 07, 2026, 11:18 PM UTC
Duration
8h 49m
Detected by Pingoru
May 07, 2026, 02:28 PM UTC

Affected components

Platform - EMEAPlatform - APACPlatform - US

Update timeline

  1. investigating May 07, 2026, 02:28 PM UTC

    We are currently investigating an issue where various functions within the FloQast application are erroring out, showing either a Lambda error or a Runtime error. This is being treated with the highest priority

  2. identified May 07, 2026, 04:01 PM UTC

    The issue has been identified and a fix is being implemented.

  3. identified May 07, 2026, 04:32 PM UTC

    The fix is currently being applied in parallel to all FloQast environments. We expect the full resolution to be in place very soon.

  4. monitoring May 07, 2026, 06:23 PM UTC

    The fix has been initiated, and we are monitoring the results. We are aware of some errors still occurring due to warm lambdas which have not fully finished restarting yet. If a service is used and a lambda that has not been fully restarted yet is invoked, an in-app error may still occur.

  5. resolved May 07, 2026, 11:18 PM UTC

    The fix for this incident has finished deploying. A full RCA will be shared in this posting as soon as it is available.

  6. postmortem May 14, 2026, 10:02 PM UTC

    **Summary:** On May 7, 2026, from approximately 07:15 AM to 05:21 PM PT, customers across all regions experienced failures in automated workflows and email notifications within FloQast, resulting in jobs failing to execute as expected. **Root Cause:** A routine update from our cloud infrastructure provider changed how certain internal network traffic was handled. A prior update within our platform had not been fully applied across all parts of our environment, which created a conflict with the provider's change. Affected workflows failed immediately rather than completing normally. **Immediate Resolution:** The infrastructure update was rolled back to a prior stable configuration, and a process was run to fully refresh the affected workflows across all regions. All three regions were subsequently returned to normal operation. **Follow-up Actions:** We are taking several steps to strengthen our resilience to infrastructure changes of this kind. Going forward, updates from our cloud provider will be validated in a dedicated test environment before reaching production, giving us earlier visibility into any compatibility issues. We are also enhancing our internal monitoring to provide greater coverage across our infrastructure, and we are improving how we receive and act on cloud provider advisories so our engineering team can respond proactively to changes that may affect our environment.