Camunda incident

Zeebe and Elasticsearch outage for us-central1, europe-west1 and europe-west2 GCP regions

Major Resolved View vendor source →

Camunda experienced a major incident on May 20, 2025 affecting Operate and Optimize and 1 more component, lasting 6h 5m. The incident has been resolved; the full update timeline is below.

Started
May 20, 2025, 06:42 AM UTC
Resolved
May 20, 2025, 12:48 PM UTC
Duration
6h 5m
Detected by Pingoru
May 20, 2025, 06:42 AM UTC

Affected components

OperateOptimizeTasklistZeebe

Update timeline

  1. identified May 20, 2025, 05:10 AM UTC

    We are currently investigating Zeebe outage for some clusters. Our team is working to identify the cause and implement a solution. Affected Regions: Iowa, North America (us-central1) & London, Europe (europe-west2)

  2. identified May 20, 2025, 06:02 AM UTC

    We are currently investigating the problem with our cloud provider support engineers.

  3. identified May 20, 2025, 06:42 AM UTC

    We are continuing our investigation of Zeebe and Elasticsearch outage for some clusters. In some cases, disks cannot be mounted. This can lead to situations, where either Zeebe processing, or data imported into WebApps (Operate, Tasklist, Optimize), or both, are impacted. Affected Regions: Iowa, North America (us-central1), Belgium, Europe (europe-west1), London, Europe (europe-west2), South Carolina, North America (us-east1), Toronto, North America (northamerica-northeast2), Toronto, North America (northamerica-northeast2), Sydney, Australia (australia-southeast1)

  4. identified May 20, 2025, 08:24 AM UTC

    We are working closely with our cloud infrastructure provider to identify the cause of the issue. In some cases, disks cannot be mounted. This can lead to situations where either Zeebe processing, data imported into WebApps (Operate, Tasklist, Optimize), or both are impacted. Connectors might also be impacted due to transitivity with other affected components. Affected Regions: Iowa, North America (us-central1), Belgium, Europe (europe-west1), London, Europe (europe-west2), South Carolina, North America (us-east1), Toronto, North America (northamerica-northeast2), Toronto, North America (northamerica-northeast2), Sydney, Australia (australia-southeast1)

  5. identified May 20, 2025, 09:38 AM UTC

    Our teams continue working closely with our cloud infrastructure provider to identify the cause of the issue and implement a solution

  6. identified May 20, 2025, 10:35 AM UTC

    Our teams are actively implementing mitigations while continuing to investigate the root cause in collaboration with our cloud infrastructure provider.

  7. identified May 20, 2025, 11:31 AM UTC

    We are observing improvements across all regions and we continue to closely monitor the situation to ensure full stability.

  8. resolved May 20, 2025, 12:48 PM UTC

    The issue causing unavailability of disks in some GCP regions has been resolved by our cloud infrastructure provider. All Camunda Services have fully recovered. We apologize for any inconvenience caused and thank you for your patience.