Badgr incident

Infrastructure issues.

Critical Resolved View vendor source →

Badgr experienced a critical incident on August 31, 2021 affecting Badgr Pro (badgr.com) and Badgr API (api.badgr.io), lasting 4h 26m. The incident has been resolved; the full update timeline is below.

Started
Aug 31, 2021, 06:45 PM UTC
Resolved
Aug 31, 2021, 11:11 PM UTC
Duration
4h 26m
Detected by Pingoru
Aug 31, 2021, 06:45 PM UTC

Affected components

Badgr Pro (badgr.com)Badgr API (api.badgr.io)

Update timeline

  1. investigating Aug 31, 2021, 06:45 PM UTC

    The infrastructure provider is experiencing network issues that are affecting the site.

  2. investigating Aug 31, 2021, 07:42 PM UTC

    This is being caused by an issue with AWS, you can follow their status page at https://status.aws.amazon.com/. We are looking for a workaround.

  3. investigating Aug 31, 2021, 08:58 PM UTC

    AWS has localized the issue to packet loss within an availability zone in the us-west-2 region. We are pursuing a workaround, and we will have an update as soon as more information is available.

  4. investigating Aug 31, 2021, 10:51 PM UTC

    We have isolated the issue and are working around it.

  5. investigating Aug 31, 2021, 10:56 PM UTC

    We have successfully worked around the AWS infrastructure issue and are back online.

  6. monitoring Aug 31, 2021, 10:57 PM UTC

    A fix has been implemented and we are monitoring the results.

  7. resolved Aug 31, 2021, 11:11 PM UTC

    This incident has been resolved.

  8. postmortem Aug 31, 2021, 11:12 PM UTC

    This outage was due to an AWS Availability Zone \(usw2-az2\) experiencing packet loss. This caused intermittent issues communicating with critical services required for Badgr to function. A work around was put in place after AWS identified the source of the issue.