CloudAMQP incident

CloudWatch and Datadog metrics missing for some clusters

Notice Resolved View vendor source →

CloudAMQP experienced a notice incident on September 12, 2024 affecting Metrics, lasting —. The incident has been resolved; the full update timeline is below.

Started
Sep 12, 2024, 07:51 AM UTC
Resolved
Sep 12, 2024, 07:51 AM UTC
Duration
Detected by Pingoru
Sep 12, 2024, 07:51 AM UTC

Affected components

Metrics

Update timeline

  1. resolved Sep 13, 2024, 02:59 PM UTC

    Between Sep 12 07:51:57 and Sep 12 16:01:05 some RabbitMQ clusters were not reporting metrics data to AWS CloudWatch and Datadog. This issue was because the "cluster_utilization"-value was introduced and not calculated the same way in all RabbitMQ versions, causing a small number of clusters to have their metrics ingestion discarded because of faulty values.