Scout APM incident

Investigating Database Issues

Critical Resolved View vendor source →

Scout APM experienced a critical incident on August 26, 2019 affecting Application Monitoring, lasting 3h 21m. The incident has been resolved; the full update timeline is below.

Started
Aug 26, 2019, 01:36 AM UTC
Resolved
Aug 26, 2019, 04:57 AM UTC
Duration
3h 21m
Detected by Pingoru
Aug 26, 2019, 01:36 AM UTC

Affected components

Application Monitoring

Update timeline

  1. investigating Aug 26, 2019, 01:36 AM UTC

    We appear to have degraded write behavior on our main Postgres database. We are investigating.

  2. identified Aug 26, 2019, 01:51 AM UTC

    We've identified the cause of the issue, and have fixed the underlying issue. We are bringing the database back online.

  3. monitoring Aug 26, 2019, 02:06 AM UTC

    Everything is back up, and buffered checkins are flowing back into the system. Data should be caught up with current in a few minutes.

  4. resolved Aug 26, 2019, 04:57 AM UTC

    All buffered checkin data has been ingested, and all components are back online.