SparkPost incident

Elevated percentage of API calls failing

Minor Resolved View vendor source →

SparkPost experienced a minor incident on May 13, 2025 affecting Metrics API - USA and Transmissions API - USA and 1 more component, lasting 6h 7m. The incident has been resolved; the full update timeline is below.

Started
May 13, 2025, 05:22 AM UTC
Resolved
May 13, 2025, 11:29 AM UTC
Duration
6h 7m
Detected by Pingoru
May 13, 2025, 05:22 AM UTC

Affected components

Metrics API - USATransmissions API - USAEvents API - USASMTP API - USASending Domains API - USASuppression List API - USABlocklist API - USAAlerts API - USAMetrics API - EUROPEEvents API - EUROPE

Update timeline

  1. investigating May 13, 2025, 05:22 AM UTC

    We are investigating an unexpected but small increase in the percentage of API calls failing since 01:30UTC. Approximately 0.015% of API calls are failing with a HTTP 504 (Gateway timeout) error. We will update this incident as we have more information.

  2. investigating May 13, 2025, 05:52 AM UTC

    We are still investigating. During the period, the error rate has remained low as initially reported, approximately ~0.015%.

  3. identified May 13, 2025, 06:26 AM UTC

    We have identified the cause as a set of stale IPs for an upstream service. We are now validating the approach for remediation.

  4. resolved May 13, 2025, 11:29 AM UTC

    After a period of extended monitoring, we have confirmed the remediation has completely resolved the issue. Requests and error rates have been normal since 06:27UTC