DRACOON experienced a critical incident on January 19, 2024 affecting Upload and Download, lasting 2h 10m. The incident has been resolved; the full update timeline is below.
Affected components
Update timeline
- investigating Jan 19, 2024, 06:35 PM UTC
We are currently investigating an issue with the DRACOON Storage Service, including upload and download functionality. Our team is working to gather more information and resolve the issue as quickly as possible. We apologize for any inconvenience this may cause and will provide updates as soon as we have them.
- monitoring Jan 19, 2024, 06:41 PM UTC
The issue with the DRACOON Storage Service has been resolved, and we are monitoring the situation to ensure it remains stable. We apologize for any inconvenience this may have caused and appreciate your patience.
- resolved Jan 19, 2024, 08:20 PM UTC
The issue with the DRACOON Storage Service has been fully resolved. All systems are now operating normally. We apologize for any inconvenience this may have caused and appreciate your patience. If you continue to experience any issues, please don't hesitate to reach out to our support team for assistance.
- postmortem Sep 18, 2024, 04:57 PM UTC
We experienced an issue with **our storage** on **2024-01-19**. Our team has worked diligently to identify the root cause and implement a resolution. In this post-mortem, we want to share the details of what happened, why it happened, what we did to resolve it, and what we will do to prevent similar incidents in the future. What happened? **Our storage systems were not available leading to up- and download problems.** Why did this happen? **An internal network outage occurred leading to heartbeat timeouts and service restarts.** What did we do? **We have analyzed the logs and ensured that there is no major problem with the network.** What can we do to improve? **We will keep on monitoring our platform for errors and ensure the operability. We increased the alarm pririties of certain alerts for faster response times for the event of a recurrence of the problem.** We apologize for any inconvenience this incident may have caused. We are committed to ensuring the stability and reliability of our services and will continue to take proactive measures to prevent similar incidents from happening in the future. If you have any questions or concerns, please don't hesitate to reach out to our support team for assistance.