W3C incident

Cloud hosting infrastructure outage

Minor Resolved View vendor source →

W3C experienced a minor incident on June 1, 2021 affecting Website and User & Group Management and 1 more component, lasting 26d 17h. The incident has been resolved; the full update timeline is below.

Started
Jun 01, 2021, 04:55 PM UTC
Resolved
Jun 28, 2021, 10:49 AM UTC
Duration
26d 17h
Detected by Pingoru
Jun 01, 2021, 04:55 PM UTC

Affected components

WebsiteUser & Group ManagementTR PublicationEmail & Mailing ListsIRCAPIValidation ServicesChapters

Update timeline

  1. identified Jun 22, 2021, 11:58 PM UTC

    A number of services continue to time out or be slow due to an ongoing outage on the storage infrastructure of our cloud provider. Blogs, Community/Business Groups, mail, mailing list archives, CVS, Calendar, IRC are impacted. Our provider is working on the issue and plan for the situation to continue at least through to next week. Our apologies for the inconvenience.

  2. identified Jun 22, 2021, 11:59 PM UTC

    We put some mitigation in place to make the Blogs and Community/Business Groups pages more responsive.

  3. identified Jun 23, 2021, 12:01 AM UTC

    Unfortunately this cloud issue is still ongoing and has even aggravated over the weekend, which made the W3C website unavailable on several occasions. Please also note that this is the cause of the delay in serving IRC logs and minutes. More details in CSAIL's Infrastructure Group message: "Continued storage issues and degraded OpenStack availability" - https://lists.csail.mit.edu/pipermail/openstack-users/2021-June/000967.html

  4. identified Jun 23, 2021, 12:09 AM UTC

    Additional update from CSAIL's Infrastructure Group: "Update on storage issues" - https://lists.csail.mit.edu/pipermail/openstack-users/2021-June/000968.html

  5. monitoring Jun 28, 2021, 10:47 AM UTC

    Our cloud provider reported that the issue affecting their storage backend was resolved and that disk performance should be back to normal.

  6. resolved Jun 28, 2021, 10:49 AM UTC

    Our tests show that performances of W3C services are back to normal.