Astronomer incident

Tasks unable to run due to worker issues

Major Resolved View vendor source →

Astronomer experienced a major incident on November 17, 2025 affecting Scheduling and Running DAGs and Tasks and Scheduling and Running DAGs and Tasks, lasting 6h 19m. The incident has been resolved; the full update timeline is below.

Started
Nov 17, 2025, 06:13 PM UTC
Resolved
Nov 18, 2025, 12:33 AM UTC
Duration
6h 19m
Detected by Pingoru
Nov 17, 2025, 06:13 PM UTC

Affected components

Scheduling and Running DAGs and TasksScheduling and Running DAGs and Tasks

Update timeline

  1. investigating Nov 17, 2025, 06:13 PM UTC

    We have found workers failing to heartbeat on several Airflow 3 deployments. Due to this tasks are unable run and eventually fail. This issue might be due to an update for the Astro Agent plugin that was rolled out today. We are currently investigating this.

  2. monitoring Nov 17, 2025, 06:42 PM UTC

    We have reverted the update that was rolled out. We are seeing worker pods in affected deployments running tasks now. We will continue monitoring this incident.

  3. resolved Nov 18, 2025, 12:33 AM UTC

    This incident has been resolved.