Netdata Outage History

Netdata is up right now

Netdata had 17 outages in the last 2 years totaling 376h 18m of downtime — averaging 0.7 incidents per month.

There were 17 Netdata outages since July 18, 2024 totaling 376h 18m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.netdata.cloud

Notice May 30, 2026

Agent connectivity issue on legacy systems

Detected by Pingoru
May 30, 2026, 11:21 AM UTC
Resolved
May 29, 2026, 06:00 AM UTC
Duration
Timeline · 1 update
  1. resolved May 30, 2026, 11:21 AM UTC

    Following a scheduled TLS certificate renewal, a subset of Netdata agents failed to reconnect to Netdata Cloud. The issue stems from outdated CA (Certificate Authority) trust stores on certain legacy Linux distributions, like: • CentOS 7 / RHEL 7 • Amazon Linux 2 • Debian 9 (Jessie) • Ubuntu 16.04 LTS • Oracle Linux 7 A quick way to test for this: curl -v https://app.netdata.cloud If the output shows an error indicating the certificate has expired, the system needs an updated CA bundle.

Read the full incident report →

Minor May 8, 2026

False-Positive Node Reachability Alerts

Detected by Pingoru
May 08, 2026, 01:13 AM UTC
Resolved
May 08, 2026, 06:00 AM UTC
Duration
4h 47m
Affected: Cloud AlertsAgent - Cloud Connection (ACLK)
Timeline · 2 updates
  1. monitoring May 08, 2026, 01:13 AM UTC

    We are currently tracking an issue with our upstream cloud provider that is affecting a subset of our Kubernetes nodes. As a result, some customers may receive false-positive alerts regarding node reachability. Please be assured that your underlying services and workloads remain operational, and these alerts can be safely ignored at this time. We are closely monitoring the situation with our provider.

  2. resolved May 08, 2026, 06:00 AM UTC

    This incident has been resolved.

Read the full incident report →

Notice April 15, 2026

Kubernetes node failure.

Detected by Pingoru
Apr 15, 2026, 06:38 AM UTC
Resolved
Apr 15, 2026, 06:38 AM UTC
Duration
Timeline · 1 update
  1. resolved Apr 15, 2026, 06:38 AM UTC

    At ~02:20 UTC, a Kubernetes node running part of our ingress controller failed and was auto-replaced. Some clients experienced brief agent disconnections and false-positive node-unavailability alerts during the failover. Service has fully recovered after few minutes. We apologize for the inconvenience.

Read the full incident report →

Notice March 9, 2026

Unexpected Loadbalancer restart

Detected by Pingoru
Mar 09, 2026, 01:49 PM UTC
Resolved
Mar 08, 2026, 03:30 PM UTC
Duration
Timeline · 1 update
  1. resolved Mar 09, 2026, 01:49 PM UTC

    Yesterday, a brief infrastructure incident occurred due to certain processes on our load balancer failing (exactly at 16:34 UTC). This caused more than half of our globally connected nodes to reconnect, which in turn triggered false positive alerts notifying clients of node downtime. We apologize for the confusion and are taking steps to ensure this does not happen again.

Read the full incident report →

Critical November 18, 2025

Services Unavailable

Detected by Pingoru
Nov 18, 2025, 12:11 PM UTC
Resolved
Nov 18, 2025, 07:22 PM UTC
Duration
7h 11m
Affected: Cloud Web UICloud ChartsAgent - Cloud Connection (ACLK)Agent Repositories
Timeline · 3 updates
  1. investigating Nov 18, 2025, 12:11 PM UTC

    As Cloudflare is experiencing network issues, Netdata Cloud, as well as our package repositories are currently unavailable.

  2. monitoring Nov 18, 2025, 12:23 PM UTC

    Connectivity is recovering, we'll keep this open until we see traffic at previous levels. See https://www.cloudflarestatus.com/incidents/8gmgl950y3h7 for details on the Cloudflare outage itself.

  3. resolved Nov 18, 2025, 07:22 PM UTC

    This incident has been resolved.

Read the full incident report →

Major September 29, 2025

Agent v2.7.0 on Windows doesn't start go.d plugin

Detected by Pingoru
Sep 29, 2025, 11:36 AM UTC
Resolved
Sep 30, 2025, 06:29 PM UTC
Duration
1d 6h
Affected: Agent (all platforms)
Timeline · 3 updates
  1. identified Sep 29, 2025, 11:36 AM UTC

    We've identified an issue with the latest Agent release on Windows that causes the go.d plugin to not be started. The impact is that for Windows nodes, none of the metrics from the go.d plugin are being collected and corresponding alerts don't fire. We will be releasing a patch release to address this as soon as possible today. As a workaround the plugin binary in Program Files can be renamed to add the missing .exe extension.

  2. monitoring Sep 30, 2025, 05:04 AM UTC

    We have released Netdata Agent v2.7.1 and recommend all Windows users to upgrade. Release notes: https://github.com/netdata/netdata/releases/tag/v2.7.1

  3. resolved Sep 30, 2025, 06:29 PM UTC

    This incident has been resolved.

Read the full incident report →

Major August 21, 2025

Package availability issues with native DEB/RPM packages.

Detected by Pingoru
Aug 21, 2025, 07:35 PM UTC
Resolved
Aug 21, 2025, 07:58 PM UTC
Duration
22m
Affected: Agent Repositories
Timeline · 4 updates
  1. investigating Aug 21, 2025, 07:35 PM UTC

    We are currently investigating issues with availability of our nightly DEB and RPM packages which are preventing updates and installs from working correctly.

  2. identified Aug 21, 2025, 07:39 PM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring Aug 21, 2025, 07:42 PM UTC

    A fix has been implemented and we expect the repositories to be functioning normally within the next 30 minutes.

  4. resolved Aug 21, 2025, 07:58 PM UTC

    All repositories are functioning normally again.

Read the full incident report →

Notice August 14, 2025

Issues with package availability in official DEB/RPM package repositories

Detected by Pingoru
Aug 14, 2025, 03:03 PM UTC
Resolved
Aug 14, 2025, 03:12 PM UTC
Duration
9m
Affected: Agent Repositories
Timeline · 2 updates
  1. monitoring Aug 14, 2025, 03:03 PM UTC

    Due to issues in our package upload process, the v2.6.2 stable release and the nightly builds for 2025-08-13 and 2025-08-14 were not fully uploaded to our official DEB/RPM repositories. We have already identified the root cause and implemented a fix, and are currently monitoring the processing of the package upload backlog to ensure that all of the packages are properly available.

  2. resolved Aug 14, 2025, 03:12 PM UTC

    We have confirmed that the issue that caused the incomplete package uploads has been fully fixed, and all packages that should have been uploaded since the issue first occurred have been fully uploaded and are available in the repositories.

Read the full incident report →

Minor August 11, 2025

Increased data request latencies

Detected by Pingoru
Aug 11, 2025, 12:27 PM UTC
Resolved
Aug 20, 2025, 01:09 PM UTC
Duration
9d
Affected: Cloud ChartsAgent - Cloud Connection (ACLK)
Timeline · 5 updates
  1. investigating Aug 11, 2025, 12:27 PM UTC

    We are investigating an issue with increased data request latencies that can result in slow or completely empty charts for some customers. The affected Netdata Agent versions are v2.6.0 and later, for all stable and nightly builds.

  2. investigating Aug 11, 2025, 02:24 PM UTC

    We will be adding additional debugging code in both the Agent (next nightly) and Cloud Backend to track what is causing these spurious latencies.

  3. identified Aug 12, 2025, 09:28 AM UTC

    We have identified the cause of this issue and have been creating a fix which will appear in the upcoming nightly build. When we are confident that there are no negative effects on that build, we will do a new patch release on the stable channel the next day. What happened is that we were not correctly clearing MQTT messages that had been acknowledged by the broker, causing newer messages to not be correctly recognized as acknowledged in certain conditions.

  4. monitoring Aug 13, 2025, 03:30 PM UTC

    We've implement a change for this issue which appeared in our latest nightly, and resolved the problem. We are now issuing a new stable release v2.6.2 to include that fix, and will resolve this issue when all packages are available.

  5. resolved Aug 20, 2025, 01:09 PM UTC

    This incident has been resolved. We recommend that Agents are upgraded to v2.6.2 on the stable channel, or the latest nightly build at the earliest convenience.

Read the full incident report →

Notice July 25, 2025

Windows Agent Monitoring Issues

Detected by Pingoru
Jul 25, 2025, 12:27 PM UTC
Resolved
Jul 28, 2025, 10:44 AM UTC
Duration
2d 22h
Affected: Agent (all platforms)
Timeline · 2 updates
  1. monitoring Jul 25, 2025, 12:27 PM UTC

    We identified monitoring issues affecting customers using Netdata Agent version 2.6.0 on Windows servers. These issues were caused by upstream changes in Windows libraries that impacted agent functionality. Resolution Fixed Version Available: Netdata Agent v2.6.1 Release Date: July 25, 2025 (16:00 UTC) Preview Available: Nightly release v2.6.0-2 Required Action Immediate upgrade recommended - All users running Netdata Agent v2.6.0 on Windows servers should upgrade to v2.6.1 as soon as possible to restore full monitoring capabilities. We appreciate your patience as we continue working on additional improvements related to the upstream Windows library changes.

  2. resolved Jul 28, 2025, 10:44 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor June 4, 2025

MQTT broker failure

Detected by Pingoru
Jun 04, 2025, 12:59 PM UTC
Resolved
Jun 04, 2025, 01:49 PM UTC
Duration
49m
Affected: Agent - Cloud Connection (ACLK)
Timeline · 3 updates
  1. identified Jun 04, 2025, 12:59 PM UTC

    We had a major problem with our MQTT broker. It is currently up and running and agents are reconnecting to the cloud.

  2. monitoring Jun 04, 2025, 01:42 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Jun 04, 2025, 01:49 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor February 25, 2025

Lastest Agent nightly build (v2.2.0-245) broken at first start

Detected by Pingoru
Feb 25, 2025, 09:41 AM UTC
Resolved
Feb 25, 2025, 03:25 PM UTC
Duration
5h 44m
Affected: Agent (all platforms)
Timeline · 5 updates
  1. investigating Feb 25, 2025, 09:41 AM UTC

    We are investigating an issue with today's nightly (v2.2.0-245), causing alerting to not work ("health") and external plugins, including go.d, to not connect properly. This may be resolved by restarting the Agent. Stable versions of the Agent are not affected.

  2. identified Feb 25, 2025, 11:04 AM UTC

    We have identified the issue, committed a fix, and initiated new nightly builds for all platforms. This will take several hours. In the mean time, please restart Netdata to work around the issue.

  3. monitoring Feb 25, 2025, 01:14 PM UTC

    The builds are completed, so we are watching out for any remaining related issues.

  4. monitoring Feb 25, 2025, 01:29 PM UTC

    We are continuing to monitor for any further issues.

  5. resolved Feb 25, 2025, 03:25 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice November 15, 2024

Alarm Processing Delays

Detected by Pingoru
Nov 15, 2024, 08:23 AM UTC
Resolved
Nov 15, 2024, 07:00 AM UTC
Duration
Timeline · 1 update
  1. resolved Nov 15, 2024, 08:23 AM UTC

    Our alarm processing infrastructure was running behind which is causing inaccuracies alarms for some nodes. No data has been lost and the systems should be already up to date.

Read the full incident report →

Minor November 8, 2024

Alerting is working slower

Detected by Pingoru
Nov 08, 2024, 08:07 AM UTC
Resolved
Nov 08, 2024, 01:36 PM UTC
Duration
5h 29m
Affected: Cloud Alerts
Timeline · 4 updates
  1. investigating Nov 08, 2024, 08:07 AM UTC

    Due to the release of Netdata Agent 2.0 we have quite a big backlog for alarms. We are investigating this issue.

  2. identified Nov 08, 2024, 09:49 AM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring Nov 08, 2024, 11:01 AM UTC

    A fix has been implemented and we are monitoring the results.

  4. resolved Nov 08, 2024, 01:36 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor July 18, 2024

Delays in alarms on the Netdata Cloud

Detected by Pingoru
Jul 18, 2024, 04:45 AM UTC
Resolved
Jul 18, 2024, 11:41 AM UTC
Duration
6h 56m
Affected: Cloud Web UI
Timeline · 4 updates
  1. investigating Jul 18, 2024, 04:45 AM UTC

    We were alerted to a delay in alarms for some users and are investigating the matter.

  2. identified Jul 18, 2024, 05:37 AM UTC

    The issue has been identified and a fix is being implemented.

  3. identified Jul 18, 2024, 08:17 AM UTC

    Currently, we are waiting for the fix to take effect, and some users might experience delays in all cloud operations.

  4. resolved Jul 18, 2024, 11:41 AM UTC

    This incident has been resolved.

Read the full incident report →