DroneMobile incident

DroneMobile - AWS Outage

Critical Resolved View vendor source →

DroneMobile experienced a critical incident on November 25, 2020, lasting 16h. The incident has been resolved; the full update timeline is below.

Started
Nov 25, 2020, 02:00 PM UTC
Resolved
Nov 26, 2020, 06:00 AM UTC
Duration
16h
Detected by Pingoru
Nov 25, 2020, 02:00 PM UTC

Update timeline

  1. investigating Dec 02, 2020, 04:24 PM UTC

    DroneMobile X1 based devices are experiencing an error when attempting to communicate with our servers. We're investigating this issue and will provide updates shortly.

  2. identified Dec 02, 2020, 04:24 PM UTC

    A ticket has been opened up with Amazon Web Services to further investigate this issue.

  3. monitoring Dec 02, 2020, 04:24 PM UTC

    Amazon Web Services is experiencing API failures for a number of core services that DroneMobile uses to deliver commands to devices. AWS has not provided an estimated time of repair, but they are actively working on the issue.

  4. resolved Dec 02, 2020, 04:24 PM UTC

    DroneMobile is back online. Per AWS: "We have restored all traffic to Kinesis Data Streams from Internet-facing endpoints, and we are continuing to incrementally restore all requests to Kinesis Data Streams using VPC Endpoints. We are also beginning to observe the incremental recovery of CloudWatch metrics functionality for new incoming metrics, and working towards full recovery. The backlog of metrics will take additional time to populate."

  5. postmortem Dec 02, 2020, 05:17 PM UTC

    DroneMobile was impacted by an AWS outage, you can find more information about this outage here: [https://aws.amazon.com/message/11201/](https://aws.amazon.com/message/11201/) The DroneMobile DevOps team is investigating ways to mitigate outages like this in the future.