Astronomer experienced a major incident on November 17, 2025 affecting Scheduling and Running DAGs and Tasks and Scheduling and Running DAGs and Tasks, lasting 6h 19m. The incident has been resolved; the full update timeline is below.
Affected components
Scheduling and Running DAGs and TasksScheduling and Running DAGs and Tasks
Update timeline
- investigating Nov 17, 2025, 06:13 PM UTC
We have found workers failing to heartbeat on several Airflow 3 deployments. Due to this tasks are unable run and eventually fail. This issue might be due to an update for the Astro Agent plugin that was rolled out today. We are currently investigating this.
- monitoring Nov 17, 2025, 06:42 PM UTC
We have reverted the update that was rolled out. We are seeing worker pods in affected deployments running tasks now. We will continue monitoring this incident.
- resolved Nov 18, 2025, 12:33 AM UTC
This incident has been resolved.