- Detected by Pingoru
- Jun 03, 2026, 03:10 PM UTC
- Resolved
- Jun 04, 2026, 06:15 PM UTC
- Duration
- 1d 3h
Affected: Event storageHunting
Timeline · 4 updates
-
investigating Jun 03, 2026, 03:10 PM UTC
An ongoing incident is currently impacting event indexing and searches for some customers. Engineers are investigating on the issue.
-
identified Jun 03, 2026, 03:11 PM UTC
An ongoing incident in the FRA1 region is impacting event indexing and search capabilities for some customers. The event storage is experiencing some failures which causes unstabilities, this results in delayed ingestion and unavailable searches. Engineers are investigating and working on mitigations to restore indexing and search functionality.
-
monitoring Jun 03, 2026, 05:41 PM UTC
The underlying infrastructure have been stabilized and indexing has resumed, we're currently catching event indexing lag.
-
resolved Jun 04, 2026, 08:58 AM UTC
This incident has been resolved. All events are being indexed in real time and available for searches.
Read the full incident report →
- Detected by Pingoru
- Jun 01, 2026, 10:24 AM UTC
- Resolved
- Jun 01, 2026, 05:11 PM UTC
- Duration
- 6h 47m
Affected: Detection
Timeline · 3 updates
-
identified Jun 01, 2026, 10:24 AM UTC
An ongoing incident is impacting detection services due to a failure in cache instances. Engineers are performing restarts and recovery steps; service restoration is in progress but may take some time.
-
monitoring Jun 01, 2026, 10:32 AM UTC
Detection workflows were unavailable due to a cluster of cache instances becoming unresponsive after a deployment. The workflows have been restarted and all services are back online; event processing may be delayed but recovery and catch-up are in progress.
-
resolved Jun 01, 2026, 05:11 PM UTC
Backlog has been processed since 18:40 CEST and systems are stable. Thank you for your patience
Read the full incident report →
- Detected by Pingoru
- May 29, 2026, 05:57 PM UTC
- Resolved
- May 29, 2026, 11:09 PM UTC
- Duration
- 5h 12m
Affected: Event storage
Timeline · 4 updates
-
investigating May 29, 2026, 05:57 PM UTC
We are currently facing an issue on our event storage impacting only some customers, impacting indexation and researches. Our engineering team is actively working on diagnosing the root cause and finding a solution. Thank you for you patience.
-
identified May 29, 2026, 07:12 PM UTC
The root cause has been identified, and our team is currently applying patches. Event indexing was delayed for the impacted communities, which means some events may appear late on event pages. Indexing is now running and processing the backlog. Search is available again. We will keep you updated.
-
monitoring May 29, 2026, 07:50 PM UTC
The patches have been applied by our engineering team, and the situation is stable again. Accumulated logs are being processed. We will keep you updated once indexing is back to real time. Thank you for your patience.
-
resolved May 29, 2026, 11:09 PM UTC
Indexing is in real-time for all communities since 23:15 CEST. Everything is stable. Thank you for your patience throughout this incident.
Read the full incident report →
- Detected by Pingoru
- May 28, 2026, 07:36 AM UTC
- Resolved
- May 28, 2026, 01:03 PM UTC
- Duration
- 5h 27m
Affected: Automation
Timeline · 4 updates
-
investigating May 28, 2026, 07:36 AM UTC
An ongoing incident is affecting automation features: playbook actions are not being triggered. Customers may experience failures of automated workflows. Engineering teams are investigating and working on a fix.
-
identified May 28, 2026, 09:24 AM UTC
Engineers identified and isolated the offending playbook and restored real-time processing. A backlog playbook tasks remains and customers may see delayed automated workflows while the backlog is processed. Teams continue to work on completing the backlog.
-
monitoring May 28, 2026, 09:37 AM UTC
Teams continue to work on completing the backlog
-
resolved May 28, 2026, 01:03 PM UTC
All the backlog has been processed Thanks for you patience
Read the full incident report →
- Detected by Pingoru
- May 20, 2026, 12:53 PM UTC
- Resolved
- May 20, 2026, 09:45 PM UTC
- Duration
- 8h 51m
Affected: Automation
Timeline · 4 updates
-
investigating May 20, 2026, 12:53 PM UTC
We are currently experiencing a major incident affecting the automation features in the FRA1 cloud region. An excessive number of automation playbook runs are being triggered overwhelming the system and causing no new playbook runs to be processed. We are investigating the issue.
-
identified May 20, 2026, 01:23 PM UTC
The major incident affecting automation features in the FRA1 cloud region is currently under recovery. The automation feature for playbooks runs was stopped to mitigate the issue and is now being restarted gradually. Customers may still experience delays in playbook runs, but recovery is in progress.
-
monitoring May 20, 2026, 04:34 PM UTC
The automation feature has been fully restored, with real-time processing resumed. We are currently handling the backlog of tasks which couldn't be handled in time.
-
resolved May 20, 2026, 09:45 PM UTC
The backlog of tasks has been processed and the automation feature is fully restored.
Read the full incident report →
- Detected by Pingoru
- May 18, 2026, 07:40 AM UTC
- Resolved
- May 18, 2026, 10:31 AM UTC
- Duration
- 2h 50m
Affected: Event storage
Timeline · 4 updates
-
investigating May 18, 2026, 07:40 AM UTC
We are currently investigating an indexing issue. This is causing processing lag to data ingestion only on few communities. Our engineering team is actively working to identify the root cause and mitigate the impact. We will provide an update shortly.
-
identified May 18, 2026, 08:05 AM UTC
We have identified the root cause of the issue and a fix is being applied.
-
monitoring May 18, 2026, 08:29 AM UTC
We have applied a fix. We are currently monitoring the systems as they return to normal. We will let you know as soon as the issue is fully resolved.
-
resolved May 18, 2026, 10:31 AM UTC
Indexing is back to real-time processing for every community. This incident is now over. Thank you for your patience.
Read the full incident report →
- Detected by Pingoru
- May 17, 2026, 07:31 AM UTC
- Resolved
- May 17, 2026, 10:28 AM UTC
- Duration
- 2h 56m
Affected: Event ingestionEvent storageDetection
Timeline · 3 updates
-
investigating May 17, 2026, 07:31 AM UTC
We are currently investigating an indexing issue specifically affecting Exalog. This is causing processing lag to data ingestion. Our engineering team is actively working to identify the root cause and mitigate the impact. We will provide an update shortly.
-
monitoring May 17, 2026, 09:13 AM UTC
We have identified the root cause of the issue and applied a fix. We are currently monitoring the systems as they return to normal. We will let you know as soon as the issue is fully resolved.
-
resolved May 17, 2026, 10:28 AM UTC
The issue with Exalog indexing and ingestion has been fully resolved. All systems have been operating normally since 11:30 CEST. Thank you for your patience.
Read the full incident report →
- Detected by Pingoru
- May 16, 2026, 07:52 AM UTC
- Resolved
- May 16, 2026, 11:49 AM UTC
- Duration
- 3h 57m
Affected: Event storageDetection
Timeline · 6 updates
-
investigating May 16, 2026, 07:52 AM UTC
We have been experiencing an issue affecting event search, alerting and processing. Users may encounter errors when trying to search or access events. Please be assured that our data reception systems are fully operational and no events are being lost. All incoming data is safely stored and will be processed as soon as the issue is resolved. Our engineering team is actively working on a fix.
-
identified May 16, 2026, 08:38 AM UTC
We have identified the root cause of the service disruption and applied a fix. Event ingestion, search, and alerting functionalities are now resuming. However, you may experience degraded performance and slower response times as our systems process the backlog of safely stored events.
-
identified May 16, 2026, 08:43 AM UTC
We are continuing to work on a fix for this issue.
-
monitoring May 16, 2026, 09:39 AM UTC
The backlog of safely queued events has been fully processed, and all systems (ingestion, search, and alerting) have returned to normal performance. We appreciate your patience and apologize for any inconvenience this disruption may have caused.
-
monitoring May 16, 2026, 10:00 AM UTC
We are currently re-running the playbooks that failed during the incident. Once these executions are complete, the incident will be marked as fully resolved.
-
resolved May 16, 2026, 11:49 AM UTC
All failed playbooks have been replayed. Thank you for your patience throughout this incident.
Read the full incident report →
- Detected by Pingoru
- May 12, 2026, 09:32 AM UTC
- Resolved
- May 12, 2026, 05:06 PM UTC
- Duration
- 7h 33m
Affected: Event ingestionDetection
Timeline · 5 updates
-
investigating May 12, 2026, 09:32 AM UTC
Since 10:43, ingestion services in FRA1 region are experiencing degradation due to the message bus. This issue causes delays and slowdowns in event processing. The engineering team is investigating to restore normal operation as soon as possible.
-
identified May 12, 2026, 09:57 AM UTC
Ingestion is slowly recovering. Additional cluster nodes are being provisioned to improve resilience. Monitoring continues to ensure full restoration.
-
identified May 12, 2026, 10:22 AM UTC
We are continuing to work on a fix for this issue.
-
monitoring May 12, 2026, 02:33 PM UTC
We applied a fix and are monitoring the region which is catching the lag on events ingestion.
-
resolved May 12, 2026, 05:06 PM UTC
We are back to real-time indexing. Thank you for your patience
Read the full incident report →
- Detected by Pingoru
- May 06, 2026, 10:19 AM UTC
- Resolved
- May 06, 2026, 08:27 PM UTC
- Duration
- 10h 8m
Affected: Event ingestion
Timeline · 4 updates
-
investigating May 06, 2026, 10:19 AM UTC
We are experiencing configuration issues affecting multiple communities which are causing delays in event data indexing. Our teams are actively working to resolve the problem and restore normal indexing speed. Updates will be provided as the situation evolves.
-
identified May 06, 2026, 02:11 PM UTC
The issue has been identified and a fix is being implemented.
-
monitoring May 06, 2026, 02:30 PM UTC
Approximately 32 servers became unresponsive due to high memory usage caused by a malfunction in the ingestion workflow components. A bulk hard reboot of affected servers has been initiated. Customers may experience delays and the ingestion of some events twice during this incident. There is no impact on automation features or playbooks. The situation is now stable and the backlog of events is being processed.
-
resolved May 06, 2026, 08:27 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- May 05, 2026, 03:21 PM UTC
- Resolved
- May 05, 2026, 05:23 PM UTC
- Duration
- 2h 1m
Affected: Event ingestionEvent storage
Timeline · 4 updates
-
investigating May 05, 2026, 03:21 PM UTC
We identified a network issue on event storage cluster which cause searches and indexation instability. We already recovered cold data, restoring search query functionality to include all events. However, we're still experiencing indexation issues which are under investigation. We're working on fix the issue, and further updates will be provided as necessary.
-
identified May 05, 2026, 04:02 PM UTC
We applied a fix and indexation has now resumed. We're now starting to catch the lag. The incident is actively monitored for stability.
-
monitoring May 05, 2026, 04:14 PM UTC
The applied fix is working as exepected and we're now cathing the lag. The incident is actively monitored. We will continue to inform you on the progress.
-
resolved May 05, 2026, 05:23 PM UTC
The indexation is back to real-time. Thank you for you patience.
Read the full incident report →
- Detected by Pingoru
- Apr 27, 2026, 09:32 AM UTC
- Resolved
- Apr 27, 2026, 11:55 AM UTC
- Duration
- 2h 23m
Affected: Web application
Timeline · 2 updates
-
monitoring Apr 27, 2026, 09:32 AM UTC
We are currently experiencing slowness on some pages due to one of our components being overloaded. A fix has been applied by our team and performances are coming back up. We are still investigating on the root cause and working on improving performances and latency. We will keep you updated
-
resolved Apr 27, 2026, 11:55 AM UTC
Response time were back to normal at 11:25 CEST and are stable since. This incident is over. Thank you for your patience.
Read the full incident report →
- Detected by Pingoru
- Apr 24, 2026, 03:59 PM UTC
- Resolved
- Apr 25, 2026, 08:50 AM UTC
- Duration
- 16h 51m
Affected: Event storage
Timeline · 4 updates
-
identified Apr 24, 2026, 03:59 PM UTC
Today at 12:38 CEST, we have started to progressively deploy our detection engine. Unfortunately, due to a misconfiguration, some events were not indexed into our storage engine, but all events were processed and related alerts were raised. The proportion of non-indexed events increased gradually. This issue was identified at 16:40 CEST and a fix was deployed around 17:00 CEST. We are now working to fix the situation and ensure that all missed events will be correctly pushed to the storage engine. During the issue, the detection engines continued to work as expected and alerts (including their related events) were correctly raised. All events related to alerts were also correctly indexed into our storage engine.
-
identified Apr 24, 2026, 04:58 PM UTC
Our team is currently deploying a fix in order to index the missed events into the storage engine. Be aware that the process is expected to cause duplicates of some events that were already pushed. Thanks for your patience.
-
monitoring Apr 24, 2026, 06:16 PM UTC
The fix has been deployed and missed events are currently being indexed into our storage engine. We will continue to inform you on the progress.
-
resolved Apr 25, 2026, 08:50 AM UTC
All missed events were correctly pushed to our storage engine during the night. This incident is now over. Thank you for your patience throughout this incident.
Read the full incident report →
- Detected by Pingoru
- Apr 16, 2026, 03:16 PM UTC
- Resolved
- Apr 16, 2026, 06:41 PM UTC
- Duration
- 3h 25m
Affected: Event ingestionEvent storageDetectionHuntingCase management
Timeline · 5 updates
-
investigating Apr 16, 2026, 03:16 PM UTC
We are currently investigating an incident affecting several components including the ingestion process and the event searches. Our engineering teams are actively working to identify the root cause and implement a resolution. Further updates will be provided as the situation evolves.
-
identified Apr 16, 2026, 03:26 PM UTC
The issue has been identified. Our teams are working on a fix.
-
monitoring Apr 16, 2026, 03:28 PM UTC
Our team partially fixed the situation. The systems are progressively recovering.
-
monitoring Apr 16, 2026, 04:02 PM UTC
The delay on event ingestion is now catching up. The event ingestion is stable and systems are recovering. Our teams are still monitoring the situation.
-
resolved Apr 16, 2026, 06:41 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Apr 15, 2026, 07:57 AM UTC
- Resolved
- Apr 15, 2026, 04:46 PM UTC
- Duration
- 8h 48m
Affected: Event ingestionAutomation
Timeline · 4 updates
-
investigating Apr 15, 2026, 07:57 AM UTC
We are currently experiencing an incident impacting the playbook automation feature. The team is actively investigating the root cause and working on remediation.
-
identified Apr 15, 2026, 08:34 AM UTC
The issue has been identified and a fix is being implemented.
-
monitoring Apr 15, 2026, 12:08 PM UTC
A fix has been implemented and we are monitoring the results.
-
resolved Apr 15, 2026, 04:46 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Apr 08, 2026, 09:15 AM UTC
- Resolved
- Apr 08, 2026, 12:07 PM UTC
- Duration
- 2h 51m
Affected: Detection
Timeline · 4 updates
-
investigating Apr 08, 2026, 08:58 AM UTC
We are currently investigating an issue affecting Assets availability on the platform. Our Engineering team is fully mobilized to resolve the issue.
-
monitoring Apr 08, 2026, 09:15 AM UTC
The issue has been identified and the assets functionalities are back up, and available through the web application and APIs. Stabilization actions are ongoing which should not further impact users
-
monitoring Apr 08, 2026, 09:15 AM UTC
We are continuing to monitor for any further issues.
-
resolved Apr 08, 2026, 12:07 PM UTC
All assets functionality have been resolved and recovery actions completed. Thank you for your patience during resolution
Read the full incident report →
- Detected by Pingoru
- Apr 04, 2026, 09:00 PM UTC
- Resolved
- Apr 04, 2026, 10:50 PM UTC
- Duration
- 1h 50m
Affected: Event ingestion
Timeline · 3 updates
-
investigating Apr 04, 2026, 09:00 PM UTC
We are currently investigating degraded ingestion performance. Our teams are actively working to identify the cause and restore normal performance as quickly as possible. We will share another update as soon as more information is available.
-
monitoring Apr 04, 2026, 09:26 PM UTC
We have identified and fixed the issue affecting ingestion. Ingestion is now operating normally again. Some delay may still be visible while the remaining lag is being processed, and we expect the situation to be fully back to normal soon. We are continuing to monitor the recovery closely.
-
resolved Apr 04, 2026, 10:50 PM UTC
The issue affecting ingestion has been resolved. Ingestion performance is back to normal, and the temporary lag has been fully absorbed. We will continue to monitor the platform, but the incident is now closed.
Read the full incident report →
- Detected by Pingoru
- Mar 20, 2026, 08:25 AM UTC
- Resolved
- Mar 23, 2026, 08:13 AM UTC
- Duration
- 2d 23h
Affected: Automation
Timeline · 4 updates
-
identified Mar 20, 2026, 08:25 AM UTC
We are currently experiencing an incident affecting task execution due to a limitation in the cluster responsible for coordination and scheduling. This issue started yesterday around 18:26 and has prevented new processing tasks from starting. The engineering team has identified the root cause to mitigate the issue. Investigation and remediation efforts are ongoing.
-
monitoring Mar 20, 2026, 09:05 AM UTC
Pod scheduling and task execution have resumed following administrative actions on the cluster. The situation is currently being monitored.
-
monitoring Mar 20, 2026, 03:56 PM UTC
We are currently replaying the pending tasks and monitoring system stability.
-
resolved Mar 23, 2026, 08:13 AM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Mar 06, 2026, 10:54 AM UTC
- Resolved
- Mar 06, 2026, 03:52 PM UTC
- Duration
- 4h 57m
Affected: Web applicationDetectionCTI Search
Timeline · 3 updates
-
identified Mar 06, 2026, 10:54 AM UTC
Following a maintenance operation on a shared event storage cluster node, observables are currently unavailable, impacting alert generation and sightings across all regions. This results in alerts not being raised, affecting the product's visibility on security events.
-
monitoring Mar 06, 2026, 10:59 AM UTC
A fix has been implemented and we are monitoring the results.
-
resolved Mar 06, 2026, 03:52 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Feb 26, 2026, 03:09 PM UTC
- Resolved
- Feb 26, 2026, 06:45 PM UTC
- Duration
- 3h 36m
Affected: Event ingestion
Timeline · 3 updates
-
identified Feb 26, 2026, 03:09 PM UTC
The indexation service in the FRA1 region is currently degraded due to multiple nodes going down in the event storage cluster caused by a hardware outage on our cloud provider. The incident started around 15:00 and prevents writing to several indices. Operations teams are performing recovery operations on the affected indices to restore service.
-
monitoring Feb 26, 2026, 03:22 PM UTC
The indexation service in the FRA1 region has been restored following a hardware outage at the cloud provider affecting multiple nodes in the event storage cluster. Recovery operations on affected indices have been completed. Monitoring continues to ensure stability and recovery.
-
resolved Feb 26, 2026, 06:45 PM UTC
Recovery has been completed, and backlogged events have been fully processed. We thank you for your patience during the resolution.
Read the full incident report →
- Detected by Pingoru
- Feb 26, 2026, 10:40 AM UTC
- Resolved
- Feb 26, 2026, 10:59 AM UTC
- Duration
- 18m
Affected: Web applicationEvent ingestionEvent storageDetectionHuntingCase managementAutomation
Timeline · 4 updates
-
investigating Feb 26, 2026, 10:40 AM UTC
Some API endpoints are returning 500 errors intermittently. Our engineering team is currently investigating the issue. We thank you for your patience during the resolution.
-
identified Feb 26, 2026, 10:57 AM UTC
The issue has been identified and a fix is being implemented.
-
monitoring Feb 26, 2026, 10:57 AM UTC
A fix has been implemented and we are monitoring the results.
-
resolved Feb 26, 2026, 10:59 AM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Feb 24, 2026, 04:20 PM UTC
- Resolved
- Feb 24, 2026, 07:05 PM UTC
- Duration
- 2h 45m
Affected: Event ingestion
Timeline · 3 updates
-
investigating Feb 24, 2026, 04:20 PM UTC
We are currently experiencing delays in event processing in the FRA1 region. This issue is related to an incident from yesterday, which has been resolved and was caused by the cloud provider. Our team is working to fully restore normal operations.
-
identified Feb 24, 2026, 06:17 PM UTC
Event processing delays in the FRA1 region persist following yesterday's cloud provider-related incident. We've identified the issue and we're working towards improving system's stability and reducing processing lags. The situation is actively monitored and being addressed.
-
resolved Feb 24, 2026, 07:05 PM UTC
Event processing delays in the FRA1 region have been resolved. Normal operations have resumed and the system is stable.
Read the full incident report →
- Detected by Pingoru
- Feb 23, 2026, 02:15 PM UTC
- Resolved
- Feb 24, 2026, 08:53 AM UTC
- Duration
- 18h 37m
Affected: Event storage
Timeline · 6 updates
-
investigating Feb 23, 2026, 02:15 PM UTC
We have identified more than a hundred servers down at once on FRA1. As this is affecting nearly all clusters, we are currently looking into a cloud provider issue. At this time we are not sure about the overall impact, as our team is looking into it.
-
identified Feb 23, 2026, 02:18 PM UTC
More than a hundred servers are down due to an issue impacting network switches in one of the data centers, causing disruptions to the message bus and other infrastructure components. Our teams are actively investigating, and we are coordinating with the data center provider to determine the estimated time for recovery.
-
identified Feb 23, 2026, 03:06 PM UTC
This incident continues to impact approximately 80 servers. The event storage cluster is recovering with nodes restarting and data becoming progressively available. Indexation has resumed with some lag still present. Frontend, APIs, automation features, and detection services remain operational. The main residual issues relate to forwarding difficulties caused by four message bus nodes being down, affecting forwarding to search indexing components. Recovery of infrastructure nodes is ongoing. We are still in contact with our cloud provider to ensure all servers come back online as soon as possible.
-
identified Feb 23, 2026, 03:58 PM UTC
This incident is ongoing with around 80 servers still down. Indexation is operating with less than 10 minutes of lag on the event storage cluster. The data center provider is manually rebooting servers that failed to start correctly, prioritizing critical nodes including message bus nodes. Recovery efforts are ongoing.
-
identified Feb 23, 2026, 05:25 PM UTC
All event storage cluster servers hosting long-term data are now online with indexes gradually recovering; workers responsible for data retention will be restarted shortly. Remaining event storage cluster servers are being restarted via a custom process and should rejoin shortly.
-
resolved Feb 24, 2026, 08:53 AM UTC
A few servers are still offline but FRA1 is fully functional since 23/02 at 23:00 CET. At this time we are still expecting a post-mortem from our cloud provider. This incident is now resolved.
Read the full incident report →
- Detected by Pingoru
- Feb 16, 2026, 05:17 PM UTC
- Resolved
- Feb 17, 2026, 08:41 AM UTC
- Duration
- 15h 23m
Affected: Web application
Timeline · 2 updates
-
investigating Feb 16, 2026, 05:17 PM UTC
At 17:50 one of our web-facing provider had a major incident on their load balancer product, resulting in our web UI and APIs being unavailable until 18:05. We have switched over to another provider, the web UI and APIs are now fully available. This does not impact event ingestion in any way. We are still monitoring the situation and opened a case with the faulty cloud provider.
-
resolved Feb 17, 2026, 08:41 AM UTC
Incident has been resolved at 18:05 as stated in previous update. All remediation actions have been identified to avoid reoccurence. Thank you for your patience during the incident.
Read the full incident report →
- Detected by Pingoru
- Feb 12, 2026, 02:16 PM UTC
- Resolved
- Feb 12, 2026, 05:01 PM UTC
- Duration
- 2h 45m
Affected: Case managementAutomation
Timeline · 3 updates
-
identified Feb 12, 2026, 02:16 PM UTC
We have identified an issue impacting retrieval of Alerts and Cases through API. The issue is also impacting the Web UI in Alerts listing and Display. Our Engineering team is applying remediation actions to restore the service. We apologize for the inconvenience caused and we thank you for your patience during the resolution.
-
identified Feb 12, 2026, 03:24 PM UTC
We are continuing to work on a fix for this issue.
-
resolved Feb 12, 2026, 05:01 PM UTC
The situation has been recovered and the Alerts and Cases functionalities are now fully availaible, through API and the Web UI. We thank you for your patience during the resolution of the incident.
Read the full incident report →