Prismatic incident

Elevated error rate for integration runner invocations in US region

Major Resolved View vendor source →

Prismatic experienced a major incident on June 26, 2023 affecting GraphQL API and Web App and 1 more component, lasting 5h 5m. The incident has been resolved; the full update timeline is below.

Started
Jun 26, 2023, 09:49 PM UTC
Resolved
Jun 27, 2023, 02:54 AM UTC
Duration
5h 5m
Detected by Pingoru
Jun 26, 2023, 09:49 PM UTC

Affected components

GraphQL APIWeb AppIntegration Runner

Update timeline

  1. investigating Jun 26, 2023, 09:49 PM UTC

    We are currently investigating this issue.

  2. investigating Jun 26, 2023, 10:03 PM UTC

    We are continuing to investigate this issue.

  3. investigating Jun 26, 2023, 10:26 PM UTC

    The engineering team is continuing to investigate issues related to the integration runner.

  4. investigating Jun 26, 2023, 10:46 PM UTC

    An issue has been identified and our engineering team is implementing a fix.

  5. investigating Jun 26, 2023, 11:15 PM UTC

    Our engineering team is continuing to implement a fix

  6. investigating Jun 26, 2023, 11:49 PM UTC

    A fix has been put in place, and the integration runner service is returning to normal operation. Our engineering team is monitoring the service.

  7. monitoring Jun 27, 2023, 12:20 AM UTC

    Our team is continuing to monitor integration runner performance.

  8. monitoring Jun 27, 2023, 12:44 AM UTC

    Mitigation efforts continue on a failing piece of critical piece of infrastructure. API and application performance are negatively impacted and Integration Runner is operating at a lower capacity than normal. A fix has been implemented and rollout continues.

  9. monitoring Jun 27, 2023, 01:30 AM UTC

    Rollout of the expected fix continues and the team continues to monitor the performance of the platform.

  10. monitoring Jun 27, 2023, 02:33 AM UTC

    We are continuing to monitor for any further issues.

  11. monitoring Jun 27, 2023, 02:35 AM UTC

    The team has completed rollout of the fix and things appear to be operating normally. We will continue to monitor the situation to ensure that no regressions appear.

  12. resolved Jun 27, 2023, 02:54 AM UTC

    All indication are that performance across the platform has been restored. The incident has been resolved.