Technolutions incident

Degraded performance for databases on OPUS cluster

Notice Resolved View vendor source →

Technolutions experienced a notice incident on January 19, 2022 affecting Slate, lasting 5h 10m. The incident has been resolved; the full update timeline is below.

Started
Jan 19, 2022, 02:36 PM UTC
Resolved
Jan 19, 2022, 07:47 PM UTC
Duration
5h 10m
Detected by Pingoru
Jan 19, 2022, 02:36 PM UTC

Affected components

Slate

Update timeline

  1. investigating Jan 19, 2022, 02:36 PM UTC

    We are currently investigating degraded performance for databases on the OPUS cluster.

  2. investigating Jan 19, 2022, 04:38 PM UTC

    Following the initial post, we throttled back some background jobs (rule execution and scheduled jobs) and have observed no further performance issues for the past two hours. We are slowly increasing the throughput of these background jobs to observe the effect on system performance. We do not have an identified root cause at this time and are exploring the possibility of an issue with underlying hardware or infrastructure.

  3. resolved Jan 19, 2022, 07:47 PM UTC

    No further performance issues have been observed, so we will proceed with resolving this incident. We will continue to monitor these systems and should any performance issues re-occur, we will proceed with additional mitigations we have prepared for an overnight window.