Netdata incident

Agent connectivity problem.

Major Resolved View vendor source →

Netdata experienced a major incident on June 6, 2023 affecting Agent - Cloud Connection (ACLK) and Agent (all platforms), lasting 1d 1h. The incident has been resolved; the full update timeline is below.

Started
Jun 06, 2023, 06:39 AM UTC
Resolved
Jun 07, 2023, 07:43 AM UTC
Duration
1d 1h
Detected by Pingoru
Jun 06, 2023, 06:39 AM UTC

Affected components

Agent - Cloud Connection (ACLK)Agent (all platforms)

Update timeline

  1. investigating Jun 06, 2023, 06:39 AM UTC

    We are currently investigating the issue.

  2. investigating Jun 06, 2023, 07:08 AM UTC

    We are continuing to investigate this issue.

  3. identified Jun 06, 2023, 07:13 AM UTC

    We found that the issue is caused by latest nightly version of the agent. We are releasing the fix.

  4. monitoring Jun 06, 2023, 09:04 AM UTC

    We had to ban 1.39.0-97 agent version from connecting to the cloud. The exact affected agent versions are: 1.39.0-97-nightly and 1.39.0-97-{hash}. This incident is going to be closed when new Netdata release will be available for the download. Please update your endpoints then or wait for an automatic update to take place tomorrow.

  5. resolved Jun 07, 2023, 07:43 AM UTC

    Connected clients metrics are going back to normal values, new Netdata Agent works as expected.