KnowledgeOwl incident

Outage

Critical Resolved View vendor source →

KnowledgeOwl experienced a critical incident on February 7, 2023 affecting Knowledge Bases and Web Application and 1 more component, lasting 1h 33m. The incident has been resolved; the full update timeline is below.

Started
Feb 07, 2023, 12:08 PM UTC
Resolved
Feb 07, 2023, 01:42 PM UTC
Duration
1h 33m
Detected by Pingoru
Feb 07, 2023, 12:08 PM UTC

Affected components

Knowledge BasesWeb ApplicationAPI

Update timeline

  1. investigating Feb 07, 2023, 01:13 PM UTC

    We are currently investigating and are working to get everything back up and running as quickly as possible. We will be posting updates here!

  2. monitoring Feb 07, 2023, 01:27 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Feb 07, 2023, 01:42 PM UTC

    All systems are operational and we will continue investigating the root cause. We are going to close this incident as it appears to be resolved. Please contact [email protected] if you need any help!

  4. postmortem Feb 07, 2023, 06:42 PM UTC

    ## Incident postmortem A traffic distribution system failed and was not restarted by automation. We are auditing these systems to ensure this failure does not happen again. Due to gaps in our overnight emergency alert tools, it took us longer than normal to resolve the issue. We are going to review our processes to make sure that more staff receive alerts when there is an outage. We will also look into training more staff to be able to handle incidents like this.