Pulseway incident

Intermittent Service Disruption - Pulseway

Minor Resolved View vendor source →
Started
Mar 12, 2026, 10:21 AM UTC
Resolved
Mar 13, 2026, 03:51 PM UTC
Duration
1d 5h
Detected by Pingoru
Mar 12, 2026, 10:21 AM UTC

Affected components

SaaS - United StatesSaaS - Europe

Update timeline

  1. investigating Mar 12, 2026, 10:21 AM UTC

    We are experiencing Intermittent service disruption in the Pulseway Web Application, which may impact administrators' ability to manage endpoints. We are aware of the issue, and our team is actively working to resolve the issue. We apologize for any inconvenience caused. - Cloud Operations Team

  2. investigating Mar 12, 2026, 03:18 PM UTC

    We are continuing to investigate this issue.

  3. identified Mar 12, 2026, 04:42 PM UTC

    The issue has been identified and a fix is being implemented in a form of a hotfix. We expect this change to be deployed in the next few hours.

  4. identified Mar 13, 2026, 08:30 AM UTC

    We are continuing to work on a fix for this issue.

  5. monitoring Mar 13, 2026, 09:55 AM UTC

    The fix has been implemented for the EMEA region and is currently being deployed for the US region. We are monitoring the results.

  6. monitoring Mar 13, 2026, 10:01 AM UTC

    The hotfix was successfully deployed in the US region, and we continue to monitor the results.

  7. resolved Mar 13, 2026, 03:51 PM UTC

    This incident has been resolved.

  8. postmortem Mar 19, 2026, 04:19 PM UTC

    # Summary Between 2026‑03‑12 at 06:21 AM EDT and 2026‑03‑13 at 11:51 AM EDT, customers experienced a performance degradation in the Web Application, which could manifest as slowness or errors while navigating the product. During this period, managed devices intermittently appeared to switch between online and offline states, triggering status‑change notifications when configured, despite the devices’ actual connection status remaining unchanged. # Root Cause A code change introduced in the 9.25 release caused the service responsible for collecting and storing Patch History information to spawn multiple unexpected instances due to a dependency-logic flaw. This led the service to repeatedly trigger full patch‑history resynchronizations across all agents. The resulting surge in request volume gradually saturated the web services, causing progressive performance degradation and ultimately rendering some web service instances unavailable. The R&D team has deployed an Agent hotfix to resolve the issue and has temporarily disabled the patch error history feature in the Web Application. This feature will be re‑enabled, and all related data will be restored during the next release cycle. # Preventative Measures To reduce the likelihood and impact of similar incidents in the future, we are taking the following steps * **Hardening of Internal Systems and Dependencies:** The R&D Team will conduct a comprehensive audit of all dependency logic across the product and verify that injected dependencies consistently perform as expected under load and across various operational scenarios. * **Enhanced Release Management and Control:** The R&D Team will implement additional mandatory quality‑assurance measures to ensure that all shared components and injected dependencies are properly validated and scoped to their intended use‑case scenarios. These measures will help prevent unintended states by confirming that dependencies behave consistently, remain isolated to their defined contexts, and do not introduce side effects across the system. * **Enhanced Monitoring, Alerting, and Response:** The R&D Team will implement enhanced monitoring to identify unexpected database growth in real time across both QA and production environments. These metrics will be integrated with existing anomaly‑detection systems to enable faster detection and response to potentially undesirable states that could impact system stability or degrade user experience. * **Enhanced Incident Management and Response:** The teams will continue to maintain and enhance scenario‑specific playbooks to ensure that validated mitigation and resolution procedures are executed quickly and accurately. These improvements are intended to further reduce the time required to detect, diagnose, and resolve issues when they occur.

Looking to track Pulseway downtime and outages?

Pingoru polls Pulseway's status page every 5 minutes and alerts you the moment it reports an issue — before your customers do.

  • Real-time alerts when Pulseway reports an incident
  • Email, Slack, Discord, Microsoft Teams, and webhook notifications
  • Track Pulseway alongside 5,000+ providers in one dashboard
  • Component-level filtering
  • Notification groups + maintenance calendar
Start monitoring Pulseway for free

5 free monitors · No credit card required