Datadog Integration Outage History

Datadog Integration partial outage · 1 active incident View live status →

Datadog Integration had 19 outages in the last 2 years totaling 1077h 18m of downtime — averaging 0.8 incidents per month.

There were 19 Datadog Integration outages since June 10, 2025 totaling 1077h 18m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://datadogintegrations.statuspage.io

Minor May 28, 2026

OCI Metrics May Be Delayed or Missing

Detected by Pingoru
May 28, 2026, 02:41 AM UTC
Resolved
May 28, 2026, 10:19 AM UTC
Duration
7h 38m
Affected: Oracle Cloud Infrastructure
Timeline · 2 updates
  1. investigating May 28, 2026, 02:41 AM UTC

    OCI Integration metrics may be delayed due to a partial OCI outage and may result in delayed or missing data in graphs displaying these metrics. We are investigating the problem and will update as we learn more.

  2. resolved May 28, 2026, 10:19 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor May 14, 2026

Slack outage

Detected by Pingoru
May 14, 2026, 02:45 PM UTC
Resolved
May 14, 2026, 03:38 PM UTC
Duration
53m
Timeline · 2 updates
  1. identified May 14, 2026, 02:45 PM UTC

    We're aware that Slack is currently experiencing an incident affecting file uploads, message edits, channel renames, and channel creations. This is a Slack platform issue and their team is actively investigating. For the latest status updates, please visit: https://slack-status.com/2026-05/fe557ca05fdb64fa We appreciate your patience and will share any relevant updates as they become available.

  2. resolved May 14, 2026, 03:38 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice May 6, 2026

Integrations installation pipeline is unavailable

Detected by Pingoru
May 06, 2026, 12:25 PM UTC
Resolved
May 06, 2026, 01:36 PM UTC
Duration
1h 10m
Timeline · 4 updates
  1. investigating May 06, 2026, 12:25 PM UTC

    We are currently investigating an issue in the integrations installation pipeline. As a result, using the integration installation command in the agent may be failing. Installing or updating extra or marketplace integrations, as well as updating core integrations, may fail. Already installed integrations are not affected and there is no data loss.

  2. identified May 06, 2026, 12:39 PM UTC

    The issue has been identified and a fix is being implemented.

  3. monitoring May 06, 2026, 12:55 PM UTC

    A fix has been implemented and we are monitoring the results.

  4. resolved May 06, 2026, 01:36 PM UTC

    This incident has been resolved.

Read the full incident report →

Major March 13, 2026

Cloudflare API metrics delayed

Detected by Pingoru
Mar 13, 2026, 07:58 AM UTC
Resolved
Mar 13, 2026, 10:24 AM UTC
Duration
2h 26m
Affected: Cloudflare Cloudflare API
Timeline · 5 updates
  1. investigating Mar 13, 2026, 07:58 AM UTC

    Cloudflare API metrics are delayed. You may see gaps and delayed metrics from Cloudflare in your dashboards.

  2. identified Mar 13, 2026, 08:04 AM UTC

    Cloudflare has detected the issue - https://www.cloudflarestatus.com/incidents/cntyn4jrg1jd You might see gaps in Cloudflare API metrics on your Dashboards

  3. identified Mar 13, 2026, 08:21 AM UTC

    We have identified the problem and are currently backfilling Cloudflare metrics.

  4. monitoring Mar 13, 2026, 08:53 AM UTC

    Cloudflare metrics are up to date. We are investigating whether metrics need to be backfilled.

  5. resolved Mar 13, 2026, 10:24 AM UTC

    The Cloudflare metrics have been backfilled. This incident is resolved.

Read the full incident report →

Minor March 3, 2026

Anthropic Integration Metrics Delayed or Missing

Detected by Pingoru
Mar 03, 2026, 09:00 PM UTC
Resolved
Mar 04, 2026, 11:33 AM UTC
Duration
14h 32m
Timeline · 3 updates
  1. monitoring Mar 03, 2026, 09:00 PM UTC

    Since March 3rd, 2026 at 6:30 PM UTC, we have been experiencing an issue with Anthropic's API which impacts collecting Anthropic metrics. Users may see delayed or missing usage metrics, including token usage, request volume, and latency. We are actively coordinating with Anthropic, who is working to resolve the issue. For additional details, please refer to Anthropic’s status page: https://status.claude.com/incidents/p7nq2jdg4zwj

  2. monitoring Mar 03, 2026, 09:00 PM UTC

    We are continuing to monitor for any further issues.

  3. resolved Mar 04, 2026, 11:33 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor March 3, 2026

OCI Metrics May Be Delayed or Missing in us-ashburn-1

Detected by Pingoru
Mar 03, 2026, 01:11 AM UTC
Resolved
Mar 03, 2026, 10:35 AM UTC
Duration
9h 23m
Affected: Oracle Cloud Infrastructure
Timeline · 3 updates
  1. investigating Mar 03, 2026, 01:11 AM UTC

    OCI Integration metrics in the us-ashburn-1 region may be delayed due to a partial OCI outage and may result in delayed or missing data in graphs displaying these metrics. We are investigating the problem and will update as we learn more.

  2. monitoring Mar 03, 2026, 08:17 AM UTC

    The outage appears to have been resolved by OCI and points are returning to Datadog. We will continue to monitor that regional metrics return to normal.

  3. resolved Mar 03, 2026, 10:35 AM UTC

    Summary of impact: OCI outage in the us-ashburn region from 7pm EST until 2am EST March 2-3, resulting in delayed or missing metrics in the ashburn region.

Read the full incident report →

Notice December 3, 2025

PagerDuty Monitor Notifications Degraded

Detected by Pingoru
Dec 03, 2025, 11:46 PM UTC
Resolved
Dec 04, 2025, 12:16 AM UTC
Duration
29m
Timeline · 2 updates
  1. monitoring Dec 03, 2025, 11:46 PM UTC

    PagerDuty is reporting impact to their notification delivery. We can see that our handoff to PagerDuty is working and we will continue to monitor it. Follow here for updates: https://status.pagerduty.com/posts/dashboard

  2. resolved Dec 04, 2025, 12:16 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor October 21, 2025

Jira & Confluence Audit Records, Zoom Activity Logs integrations are failing

Detected by Pingoru
Oct 21, 2025, 07:18 PM UTC
Resolved
Oct 23, 2025, 12:25 AM UTC
Duration
1d 5h
Timeline · 3 updates
  1. investigating Oct 21, 2025, 07:18 PM UTC

    This could affect logs and cloud SIEM customers who rely on such integrations. We are reaching out to Atlassian to resolve this issue.

  2. identified Oct 22, 2025, 03:18 PM UTC

    We have identified an issue with the Jira & Confluence Audit Records, and Zoom Activity Logs integrations. As a result of this issue, affected customers might see log data gaps which could also affect Cloud SIEM rules relying on this data. We are working with the different vendors to resolve this issue and fix the data collection.

  3. resolved Oct 23, 2025, 12:25 AM UTC

    We have localized the issue to an authorization token expiration and have communicated resolution steps to the small subset of customers who have been impacted.

Read the full incident report →

Minor October 20, 2025

Several Web Integrations affected due to Vendors' outage in US1-east

Detected by Pingoru
Oct 20, 2025, 09:31 AM UTC
Resolved
Oct 21, 2025, 07:23 PM UTC
Duration
1d 9h
Affected: SlackPagerDuty Notification DeliveryOpsgenie Notification DeliveryMicrosoft TeamsBigPanda
Timeline · 6 updates
  1. identified Oct 20, 2025, 09:31 AM UTC

    AWS US1-east data center has internal issues resulting in several impacted integrations with vendors. Affected vendors and their integrations with us: Metrics integrations: - Sendgrid - Salesforce - Godaddy - DBT Cloud Logs integrations: - Twilio - Jump Cloud - Sophos - Atlassian - Klaviyo - Trend Micro - Genesys Notification integrations: - Opsgenie - Pagerduty - Victorops - Webhooks - Jira - BigPanda - ServiceNow

  2. monitoring Oct 20, 2025, 10:28 AM UTC

    AWS U1-east data center has been degraded and some integrations has been able to slowly recover. Those vendors that are still suffering an impact, and consequently, our integrations: - Atlassian (including Jira and Confluence products) - Klaviyo

  3. monitoring Oct 20, 2025, 04:58 PM UTC

    We are continuing to monitor for any further issues.

  4. monitoring Oct 20, 2025, 05:55 PM UTC

    We are continuing to monitor for any further issues.

  5. monitoring Oct 20, 2025, 07:19 PM UTC

    We are continuing to monitor for any further issues.

  6. resolved Oct 21, 2025, 07:23 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice September 13, 2025

Upstream OpenAI API instability may result in delayed metrics

Detected by Pingoru
Sep 13, 2025, 02:01 AM UTC
Resolved
Oct 20, 2025, 11:16 AM UTC
Duration
37d 9h
Timeline · 3 updates
  1. investigating Sep 13, 2025, 02:01 AM UTC

    We are investigating an issue with OpenAI API not returning any data. OpenAI metrics may be delayed as a result.

  2. identified Sep 15, 2025, 04:05 PM UTC

    Upstream instability is persisting, and will affect any customer using a project-scoped API key for the OpenAI integration. Project-scoped API keys have been deprecated for Datadog's OpenAI integration, and we recommend transitioning to an admin-scoped API key instead.

  3. resolved Oct 20, 2025, 11:16 AM UTC

    This incident has been resolved.

Read the full incident report →

Major August 28, 2025

Pagerduty Monitor Notifications Delayed

Detected by Pingoru
Aug 28, 2025, 04:47 AM UTC
Resolved
Aug 28, 2025, 09:05 AM UTC
Duration
4h 17m
Affected: PagerDuty Notification Delivery
Timeline · 5 updates
  1. investigating Aug 28, 2025, 04:47 AM UTC

    Monitor Notifications are delayed for Pagerduty.

  2. monitoring Aug 28, 2025, 05:29 AM UTC

    We are continuing to observe delays in notifications delivery

  3. monitoring Aug 28, 2025, 06:31 AM UTC

    We are continuing to observe delays in notifications delivery, PagerDuty has updated their status page https://status.pagerduty.com/posts/details/P0LKNIW

  4. monitoring Aug 28, 2025, 07:42 AM UTC

    We are observing some recovery of notifications delays and continue to monitor the situation.

  5. resolved Aug 28, 2025, 09:05 AM UTC

    PagerDuty notifications deliveries are back to normal.

Read the full incident report →

Notice August 13, 2025

Google Chat commands are not responsive

Detected by Pingoru
Aug 13, 2025, 02:05 PM UTC
Resolved
Aug 14, 2025, 05:45 PM UTC
Duration
1d 3h
Affected: Google Apps Hangouts
Timeline · 3 updates
  1. investigating Aug 13, 2025, 02:05 PM UTC

    We are currently investigating an issue with Google Chat commands. Notifications are being sent but customers are unable to set up new handles at this time.

  2. identified Aug 13, 2025, 02:27 PM UTC

    This is now impacting notifications in our US5 and AP1 datacenters. We are rolling back the change that impacted this integration.

  3. resolved Aug 14, 2025, 05:45 PM UTC

    Google chat commands should now be working. Notifications stopped in US5 and AP1 for about 10 minutes yesterday and were resent shortly after.

Read the full incident report →

Notice June 25, 2025

Delays in Microsoft 365 Audit Logs

Detected by Pingoru
Jun 25, 2025, 05:52 PM UTC
Resolved
Jun 26, 2025, 07:00 PM UTC
Duration
1d 1h
Timeline · 3 updates
  1. identified Jun 25, 2025, 05:52 PM UTC

    An issue with the Microsoft-365 audit logs integration has been identified, and a fix is underway. Logs currently produced by this integration will be significantly delayed.

  2. monitoring Jun 25, 2025, 06:41 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Jun 26, 2025, 07:00 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice June 24, 2025

Bits AI SRE Investigations are degraded due to an ongoing OpenAI incident

Detected by Pingoru
Jun 24, 2025, 08:22 PM UTC
Resolved
Jun 25, 2025, 12:34 AM UTC
Duration
4h 11m
Timeline · 3 updates
  1. monitoring Jun 24, 2025, 08:22 PM UTC

    We are aware that Bits AI SRE Investigations are currently degraded due to an ongoing OpenAI incident (status: https://status.openai.com/).

  2. monitoring Jun 24, 2025, 08:30 PM UTC

    We are seeing recovery in Bits AI SRE performance. We are continuing to monitor the status of the OpenAI incident.

  3. resolved Jun 25, 2025, 12:34 AM UTC

    This incident has been resolved.

Read the full incident report →

Notice June 12, 2025

GCP Integration Metrics delayed

Detected by Pingoru
Jun 12, 2025, 06:51 PM UTC
Resolved
Jun 12, 2025, 09:29 PM UTC
Duration
2h 37m
Affected: Google Cloud Platform Cloud Monitoring API
Timeline · 3 updates
  1. identified Jun 12, 2025, 06:51 PM UTC

    We are investigating increased latency processing Google Cloud Metrics due to a cloud provider outage. As a result of this issue, users may see delays or gaps in graphs that contain these metrics.

  2. monitoring Jun 12, 2025, 09:20 PM UTC

    We are seeing recovery and are continuing to monitor.

  3. resolved Jun 12, 2025, 09:29 PM UTC

    This incident has been resolved.

Read the full incident report →

Major June 12, 2025

Elevated API Errors

Detected by Pingoru
Jun 12, 2025, 06:45 PM UTC
Resolved
Jun 12, 2025, 10:29 PM UTC
Duration
3h 43m
Affected: SlackMicrosoft Teams
Timeline · 3 updates
  1. investigating Jun 12, 2025, 06:45 PM UTC

    We're experiencing an elevated level of API errors and are currently looking into the issue.

  2. investigating Jun 12, 2025, 10:28 PM UTC

    We are continuing to investigate this issue.

  3. resolved Jun 12, 2025, 10:29 PM UTC

    This incident has been resolved.

Read the full incident report →