Tuist incident

[Resolved] tuist.dev not reachable

Tuist is currently experiencing a major incident, which began 1h ago. The vendor's full update timeline is below.

Started
Jun 14, 2026, 01:59 PM UTC
Resolved
Ongoing
Duration
● 1h 16m
Detected by Pingoru
Jun 14, 2026, 01:59 PM UTC

Update timeline

  1. investigating Jun 14, 2026, 01:59 PM UTC

    Production tuist.dev returned 503 because all main server pods were crash-looping during startup. The immediate failure was a timeout in license validation against Keygen, but Keygen itself was healthy. The real issue was the production stable egress gateway. Server pod traffic is routed through a Cilium egress gateway using the fixed IP 116.202.0.10. After node churn, the gateway node label and Hetzner floating IP were not attached to any active node, so selected server traffic could not reach the public internet. We restored service by assigning the floating IP to a live general worker, labeling that node as the stable egress gateway, forcing Cilium to refresh the policy, and restarting the server deployment. tuist.dev is now healthy again. To prevent this from recurring, we are working on making the stable egress setup declarative/self-healing instead of relying on a manual node handoff, and adding monitoring for the gateway readiness and server egress path.