Astronomer incident
Some clusters are unable to start new KPO tasks
Astronomer experienced a critical incident on July 3, 2025 affecting Scheduling and Running DAGs and Tasks and Scheduling and Running DAGs and Tasks, lasting 2h 51m. The incident has been resolved; the full update timeline is below.
Affected components
Scheduling and Running DAGs and TasksScheduling and Running DAGs and Tasks
Update timeline
- investigating Jul 03, 2025, 11:19 PM UTC
Some clusters that were updated today will fail to run any KPO tasks
- identified Jul 03, 2025, 11:52 PM UTC
The issue has been identified and a fix is being implemented.
- monitoring Jul 04, 2025, 01:06 AM UTC
A fix has been implemented. We are currently monitoring the results.
- resolved Jul 04, 2025, 02:10 AM UTC
This incident has been resolved.