Peakon incident

Elevated error rates in dashboard and survey

Critical Resolved View vendor source →

Peakon experienced a critical incident on November 10, 2020 affecting Dashboard and Survey and 1 more component, lasting 6h 10m. The incident has been resolved; the full update timeline is below.

Started
Nov 10, 2020, 10:43 AM UTC
Resolved
Nov 10, 2020, 04:53 PM UTC
Duration
6h 10m
Detected by Pingoru
Nov 10, 2020, 10:43 AM UTC

Affected components

DashboardSurveyAPI

Update timeline

  1. investigating Nov 10, 2020, 10:43 AM UTC

    We are seeing elevated error rates across our web applications, affecting the Peakon dashboards and survey. We are currently investigating the root cause.

  2. identified Nov 10, 2020, 11:59 AM UTC

    We have identified the issue to be an increased load introduced by a third party and we have since blocked the related account. All systems are back online, and we are monitoring the situation.

  3. monitoring Nov 10, 2020, 12:22 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. investigating Nov 10, 2020, 02:17 PM UTC

    We are again seeing elevated error rates, and are investigating.

  5. identified Nov 10, 2020, 02:36 PM UTC

    The issue has been identified and a fix is being implemented.

  6. monitoring Nov 10, 2020, 03:13 PM UTC

    We have put a fix in place to mitigate the issue, and are monitoring the situation. All services are back online.

  7. resolved Nov 10, 2020, 04:53 PM UTC

    This incident has now been resolved.