Netdata Outage History

Netdata is up right now

Netdata had 11 outages in the last 2 years totaling 333h 44m of downtime — averaging 0.5 incidents per month.

There were 11 Netdata outages since June 4, 2025 totaling 333h 44m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.netdata.cloud

Minor May 8, 2026

False-Positive Node Reachability Alerts

Detected by Pingoru
May 08, 2026, 01:13 AM UTC
Resolved
May 08, 2026, 06:00 AM UTC
Duration
4h 47m
Affected: Cloud AlertsAgent - Cloud Connection (ACLK)
Timeline · 2 updates
  1. monitoring May 08, 2026, 01:13 AM UTC

    We are currently tracking an issue with our upstream cloud provider that is affecting a subset of our Kubernetes nodes. As a result, some customers may receive false-positive alerts regarding node reachability. Please be assured that your underlying services and workloads remain operational, and these alerts can be safely ignored at this time. We are closely monitoring the situation with our provider.

  2. resolved May 08, 2026, 06:00 AM UTC

    This incident has been resolved.

Read the full incident report →

Notice April 15, 2026

Kubernetes node failure.

Detected by Pingoru
Apr 15, 2026, 06:38 AM UTC
Resolved
Apr 15, 2026, 06:38 AM UTC
Duration
Timeline · 1 update
  1. resolved Apr 15, 2026, 06:38 AM UTC

    At ~02:20 UTC, a Kubernetes node running part of our ingress controller failed and was auto-replaced. Some clients experienced brief agent disconnections and false-positive node-unavailability alerts during the failover. Service has fully recovered after few minutes. We apologize for the inconvenience.

Read the full incident report →

Notice March 9, 2026

Unexpected Loadbalancer restart

Detected by Pingoru
Mar 09, 2026, 01:49 PM UTC
Resolved
Mar 08, 2026, 03:30 PM UTC
Duration
Timeline · 1 update
  1. resolved Mar 09, 2026, 01:49 PM UTC

    Yesterday, a brief infrastructure incident occurred due to certain processes on our load balancer failing (exactly at 16:34 UTC). This caused more than half of our globally connected nodes to reconnect, which in turn triggered false positive alerts notifying clients of node downtime. We apologize for the confusion and are taking steps to ensure this does not happen again.

Read the full incident report →

Critical November 18, 2025

Services Unavailable

Detected by Pingoru
Nov 18, 2025, 12:11 PM UTC
Resolved
Nov 18, 2025, 07:22 PM UTC
Duration
7h 11m
Affected: Cloud Web UICloud ChartsAgent - Cloud Connection (ACLK)Agent Repositories
Timeline · 3 updates
  1. investigating Nov 18, 2025, 12:11 PM UTC

    As Cloudflare is experiencing network issues, Netdata Cloud, as well as our package repositories are currently unavailable.

  2. monitoring Nov 18, 2025, 12:23 PM UTC

    Connectivity is recovering, we'll keep this open until we see traffic at previous levels. See https://www.cloudflarestatus.com/incidents/8gmgl950y3h7 for details on the Cloudflare outage itself.

  3. resolved Nov 18, 2025, 07:22 PM UTC

    This incident has been resolved.

Read the full incident report →

Major September 29, 2025

Agent v2.7.0 on Windows doesn't start go.d plugin

Detected by Pingoru
Sep 29, 2025, 11:36 AM UTC
Resolved
Sep 30, 2025, 06:29 PM UTC
Duration
1d 6h
Affected: Agent (all platforms)
Timeline · 3 updates
  1. identified Sep 29, 2025, 11:36 AM UTC

    We've identified an issue with the latest Agent release on Windows that causes the go.d plugin to not be started. The impact is that for Windows nodes, none of the metrics from the go.d plugin are being collected and corresponding alerts don't fire. We will be releasing a patch release to address this as soon as possible today. As a workaround the plugin binary in Program Files can be renamed to add the missing .exe extension.

  2. monitoring Sep 30, 2025, 05:04 AM UTC

    We have released Netdata Agent v2.7.1 and recommend all Windows users to upgrade. Release notes: https://github.com/netdata/netdata/releases/tag/v2.7.1

  3. resolved Sep 30, 2025, 06:29 PM UTC

    This incident has been resolved.

Read the full incident report →

Major August 21, 2025

Package availability issues with native DEB/RPM packages.

Detected by Pingoru
Aug 21, 2025, 07:35 PM UTC
Resolved
Aug 21, 2025, 07:58 PM UTC
Duration
22m
Affected: Agent Repositories
Timeline · 4 updates
  1. investigating Aug 21, 2025, 07:35 PM UTC

    We are currently investigating issues with availability of our nightly DEB and RPM packages which are preventing updates and installs from working correctly.

  2. identified Aug 21, 2025, 07:39 PM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring Aug 21, 2025, 07:42 PM UTC

    A fix has been implemented and we expect the repositories to be functioning normally within the next 30 minutes.

  4. resolved Aug 21, 2025, 07:58 PM UTC

    All repositories are functioning normally again.

Read the full incident report →

Notice August 14, 2025

Issues with package availability in official DEB/RPM package repositories

Detected by Pingoru
Aug 14, 2025, 03:03 PM UTC
Resolved
Aug 14, 2025, 03:12 PM UTC
Duration
9m
Affected: Agent Repositories
Timeline · 2 updates
  1. monitoring Aug 14, 2025, 03:03 PM UTC

    Due to issues in our package upload process, the v2.6.2 stable release and the nightly builds for 2025-08-13 and 2025-08-14 were not fully uploaded to our official DEB/RPM repositories. We have already identified the root cause and implemented a fix, and are currently monitoring the processing of the package upload backlog to ensure that all of the packages are properly available.

  2. resolved Aug 14, 2025, 03:12 PM UTC

    We have confirmed that the issue that caused the incomplete package uploads has been fully fixed, and all packages that should have been uploaded since the issue first occurred have been fully uploaded and are available in the repositories.

Read the full incident report →

Minor August 11, 2025

Increased data request latencies

Detected by Pingoru
Aug 11, 2025, 12:27 PM UTC
Resolved
Aug 20, 2025, 01:09 PM UTC
Duration
9d
Affected: Cloud ChartsAgent - Cloud Connection (ACLK)
Timeline · 5 updates
  1. investigating Aug 11, 2025, 12:27 PM UTC

    We are investigating an issue with increased data request latencies that can result in slow or completely empty charts for some customers. The affected Netdata Agent versions are v2.6.0 and later, for all stable and nightly builds.

  2. investigating Aug 11, 2025, 02:24 PM UTC

    We will be adding additional debugging code in both the Agent (next nightly) and Cloud Backend to track what is causing these spurious latencies.

  3. identified Aug 12, 2025, 09:28 AM UTC

    We have identified the cause of this issue and have been creating a fix which will appear in the upcoming nightly build. When we are confident that there are no negative effects on that build, we will do a new patch release on the stable channel the next day. What happened is that we were not correctly clearing MQTT messages that had been acknowledged by the broker, causing newer messages to not be correctly recognized as acknowledged in certain conditions.

  4. monitoring Aug 13, 2025, 03:30 PM UTC

    We've implement a change for this issue which appeared in our latest nightly, and resolved the problem. We are now issuing a new stable release v2.6.2 to include that fix, and will resolve this issue when all packages are available.

  5. resolved Aug 20, 2025, 01:09 PM UTC

    This incident has been resolved. We recommend that Agents are upgraded to v2.6.2 on the stable channel, or the latest nightly build at the earliest convenience.

Read the full incident report →

Notice July 25, 2025

Windows Agent Monitoring Issues

Detected by Pingoru
Jul 25, 2025, 12:27 PM UTC
Resolved
Jul 28, 2025, 10:44 AM UTC
Duration
2d 22h
Affected: Agent (all platforms)
Timeline · 2 updates
  1. monitoring Jul 25, 2025, 12:27 PM UTC

    We identified monitoring issues affecting customers using Netdata Agent version 2.6.0 on Windows servers. These issues were caused by upstream changes in Windows libraries that impacted agent functionality. Resolution Fixed Version Available: Netdata Agent v2.6.1 Release Date: July 25, 2025 (16:00 UTC) Preview Available: Nightly release v2.6.0-2 Required Action Immediate upgrade recommended - All users running Netdata Agent v2.6.0 on Windows servers should upgrade to v2.6.1 as soon as possible to restore full monitoring capabilities. We appreciate your patience as we continue working on additional improvements related to the upstream Windows library changes.

  2. resolved Jul 28, 2025, 10:44 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor June 4, 2025

MQTT broker failure

Detected by Pingoru
Jun 04, 2025, 12:59 PM UTC
Resolved
Jun 04, 2025, 01:49 PM UTC
Duration
49m
Affected: Agent - Cloud Connection (ACLK)
Timeline · 3 updates
  1. identified Jun 04, 2025, 12:59 PM UTC

    We had a major problem with our MQTT broker. It is currently up and running and agents are reconnecting to the cloud.

  2. monitoring Jun 04, 2025, 01:42 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Jun 04, 2025, 01:49 PM UTC

    This incident has been resolved.

Read the full incident report →