DataCite incident

Heavy load across services.

Minor Resolved View vendor source →

DataCite experienced a minor incident on July 24, 2019 affecting REST API and Fabrica, lasting 4h 39m. The incident has been resolved; the full update timeline is below.

Started
Jul 24, 2019, 02:56 PM UTC
Resolved
Jul 24, 2019, 07:35 PM UTC
Duration
4h 39m
Detected by Pingoru
Jul 24, 2019, 02:56 PM UTC

Affected components

REST APIFabrica

Update timeline

  1. investigating Jul 24, 2019, 02:56 PM UTC

    We are currently experiencing an increased load on our core services, which is causing a delay to complete background queued jobs such as indexing of DOI's within search results or those available in the API. No data loss is occurring, just processing time to be fully realised across all our services is affected.

  2. monitoring Jul 24, 2019, 05:44 PM UTC

    We have manually removed a large number of queued jobs from our backlog and are monitoring the situation. There maybe still some delay to those DOI's registered today.

  3. resolved Jul 24, 2019, 07:35 PM UTC

    The incident has been resolved and all systems are operating normally again. Some DOIs registered or updated since about 18:00 GMT on July 23 might not have been updated properly. We are working on this and hope to have this resolved by 16:00 PM GMT tomorrow. The incident was triggered by very unusual activity by a member, trying to update more than a million DOIs in a single morning.