KnowledgeOwl incident

Site outage

Critical Resolved View vendor source →

KnowledgeOwl experienced a critical incident on May 20, 2021 affecting Knowledge Bases and Web Application and 1 more component, lasting 1h 33m. The incident has been resolved; the full update timeline is below.

Started
May 20, 2021, 07:20 PM UTC
Resolved
May 20, 2021, 08:53 PM UTC
Duration
1h 33m
Detected by Pingoru
May 20, 2021, 07:20 PM UTC

Affected components

Knowledge BasesWeb ApplicationAPI

Update timeline

  1. investigating May 20, 2021, 07:20 PM UTC

    We've had reports of 500 application errors and/or generalized slowness in the KnowledgeOwl application, knowledge bases, and API. Right now we're treating this as a major outage and are investigating; we'll provide updates as soon as we have more information.

  2. monitoring May 20, 2021, 07:49 PM UTC

    We've identified the issue and released a fix, which already seems to be working for most customers. We're continuing to monitor results.

  3. monitoring May 20, 2021, 08:18 PM UTC

    Performance seems to be back to normal for all customers who've reported and responded to our follow-ups. We are continuing to monitor the solution.

  4. resolved May 20, 2021, 08:53 PM UTC

    We've resolved the issue. Thanks for bearing with us and we are so sorry for the trouble today. We appreciate everyone who reported the problem and helped to confirm the fix. If you are still having issues, please email us at [email protected].

  5. postmortem May 20, 2021, 10:33 PM UTC

    ## Summary: We faced an unsuccessful Denial of Service \(DoS\) attempt that caused degraded performance for many customers. ## Next steps: As part of our infrastructure upgrades, we’re testing and reviewing anti-DoS measures. Once implemented, these measures will prevent future DoS attempts like this.