SignalFx US1 incident

Splunk APM Monitoring MetricSets Delayed

Minor Resolved View vendor source →

SignalFx US1 experienced a minor incident on March 26, 2026 affecting Splunk APM Interface and Splunk APM Monitoring MetricSets, lasting 2h 27m. The incident has been resolved; the full update timeline is below.

Started
Mar 26, 2026, 02:18 AM UTC
Resolved
Mar 26, 2026, 04:46 AM UTC
Duration
2h 27m
Detected by Pingoru
Mar 26, 2026, 02:18 AM UTC

Affected components

Splunk APM InterfaceSplunk APM Monitoring MetricSets

Update timeline

  1. investigating Mar 26, 2026, 02:18 AM UTC

    A degradation in the performance of the Splunk APM metrics processing pipeline is causing Monitoring MetricSets to be delayed by more than five minutes. Trace data ingest is not impacted, but service, endpoint and workflow dashboards, and other charts and detectors built from Monitoring MetricSets are impacted.

  2. investigating Mar 26, 2026, 02:20 AM UTC

    We are continuing to investigate this issue.

  3. investigating Mar 26, 2026, 02:21 AM UTC

    We are continuing to investigate this issue.

  4. investigating Mar 26, 2026, 02:45 AM UTC

    We are continuing to investigate the performance degradation affecting Splunk APM Monitoring MetricSets. While trace data ingestion remains fully operational, users may experience delays of more than five minutes in dashboard, chart, and detector data. Our team is actively working to restore normal processing speeds, and we will provide further updates as more information becomes available

  5. investigating Mar 26, 2026, 03:18 AM UTC

    We are actively working to resolve the delay affecting trace monitoring metrics. While trace data ingestion remains unaffected, please note that dashboards, charts, and detectors may experience delays of over five minutes. We are working to restore normal latency and will provide further updates as they become available.

  6. monitoring Mar 26, 2026, 03:56 AM UTC

    Recovery efforts for trace monitoring metrics in the US1 region are progressing, with system lag steadily decreasing. We continue to monitor the service closely as it returns to normal performance levels. No data loss has occurred.

  7. monitoring Mar 26, 2026, 04:26 AM UTC

    We are continuing our recovery efforts for trace monitoring metrics in the US1 region. As system lag steadily decreases, we remain engaged in our standard monitoring procedures to ensure a full return to service. No data loss has occurred.

  8. resolved Mar 26, 2026, 04:46 AM UTC

    The issue impacting trace monitoring metrics in the US1 region is now resolved. All the APM monitoring sets are now fully operational and performing as expected. We confirm that no data was lost. Thank you for your patience.