Strigo experienced a major incident on August 11, 2020 affecting Strigo service and AWS EC2 (eu-west-1), lasting 53m. The incident has been resolved; the full update timeline is below.
Affected components
Strigo serviceAWS EC2 (eu-west-1)
Update timeline
- investigating Aug 11, 2020, 07:00 PM UTC
We are currently investigating this issue.
- investigating Aug 11, 2020, 07:00 PM UTC
We are currently investigating.
- monitoring Aug 11, 2020, 07:23 PM UTC
A fix has been implemented and we are monitoring the results.
- monitoring Aug 11, 2020, 07:29 PM UTC
We've experienced an outage in our Lab Relay Service. It seems like something triggered a massive reconnection cycle from our relay to all of the labs, resulting in high CPU across the board, thus preventing successful reconnects. We've added more resources, and it seems to have settled down. We don't yet know what the root cause is, but we'll keep exploring.
- resolved Aug 11, 2020, 07:53 PM UTC
This incident has been resolved.