Brillium incident

Assessment Builder application degraded performance

Brillium experienced a major incident on September 23, 2025 affecting Assessment Authoring and Assessment Delivery, lasting 3h 42m. The incident has been resolved; the full update timeline is below.

Started: Sep 23, 2025, 01:42 PM UTC
Resolved: Sep 23, 2025, 05:25 PM UTC
Duration: 3h 42m
Detected by Pingoru: Sep 23, 2025, 01:42 PM UTC

Affected components

Assessment AuthoringAssessment Delivery

Update timeline

investigating Sep 23, 2025, 01:42 PM UTC

We are currently investigating this issue.
investigating Sep 23, 2025, 01:57 PM UTC

We are investigating degraded performance with Assessment Builder. Other applications appear to be unaffected.
investigating Sep 23, 2025, 03:17 PM UTC

We are currently validating our investigation findings.
identified Sep 23, 2025, 03:48 PM UTC

The systems operations team will be performing a systems refresh. This should take about 15 minutes. During this time, there may be limited access to brillium Assessment Builder.
monitoring Sep 23, 2025, 04:13 PM UTC

All systems are available and operating as expected. We will be monitoring systems closely during the next few hours to ensure system operations are performing as expected.
resolved Sep 23, 2025, 05:25 PM UTC

All systems have returned to normal operational performance.
postmortem Sep 23, 2025, 05:25 PM UTC

Our investigation revealed that at approxiamately 2025-09-23T12:29:10Z \(UTC\) we experienced a temporary service disruption caused by an unexpected and extreme surge in network traffic. At peak, our systems were servicing over 20,000 simultaneous requests per second, which impacted Application Builder performance for a significant segment of our customers. Initial analysis of our Application Firewall \(AF\) logs shows that requests were valid and there is no indication of malicious activity. Our System Operations team reviewed and refreshed affected systems to restore system services performance and availability. We will continue to investigate the root cause data and apply additional operational enhancements aimed at improving the platform’s ability to handle unusual activity of this nature.