Proemion incident

Web service performance degradation

Minor Resolved View vendor source →

Proemion experienced a minor incident on June 21, 2017, lasting 3h 15m. The incident has been resolved; the full update timeline is below.

Started
Jun 21, 2017, 03:30 AM UTC
Resolved
Jun 21, 2017, 06:46 AM UTC
Duration
3h 15m
Detected by Pingoru
Jun 21, 2017, 03:30 AM UTC

Update timeline

  1. identified Jun 21, 2017, 03:30 AM UTC

    One of our server nodes is experiencing an outage. The PROEMION Data Platform is fully available, but you may experience degraded performance.

  2. resolved Jun 21, 2017, 06:46 AM UTC

    The server has been restored and everything is working normally again.

  3. postmortem Aug 02, 2018, 04:40 PM UTC

    # Facts This morning, from 3 to 5 AM UTC, we experienced server outages that affected the Data Platform. The platform took the outages in a graceful fashion. The only direct impact for our customers was a latency spike in the web service. All requests succeeded, but four percent of requests were delayed. Device connections were lost, but devices were immediately able to reconnect and then worked as usual. The user interfaces, data processing and alerting proceeded to work fine. # Ongoing measures We have improved monitoring for the web services, and we will continue to look into options for how to avoid such latency spikes for similar failures in the future.