Iterable Outage History

Iterable is up right now

Iterable had 49 outages in the last 2 years totaling 337h 5m of downtime — averaging 2 incidents per month.

There were 49 Iterable outages since July 5, 2024 totaling 337h 5m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.iterable.com

Notice May 19, 2025

Mailgun has Degraded Performance

Detected by Pingoru
May 19, 2025, 09:42 PM UTC
Resolved
May 19, 2025, 10:28 PM UTC
Duration
46m
Timeline · 4 updates
  1. investigating May 19, 2025, 09:42 PM UTC

    Mailgun is reporting degraded performance across their application https://status.mailgun.com/. We are monitoring mailgun and their performance.

  2. monitoring May 19, 2025, 09:43 PM UTC

    We are monitoring the affect of the mailgun degradation

  3. monitoring May 19, 2025, 09:54 PM UTC

    Mailgun has implemented a fix and is in Monitoring status

  4. resolved May 19, 2025, 10:28 PM UTC

    This incident has been resolved.

Read the full incident report →

Critical March 14, 2025

Ingestion Delay on Clusters 100 and Above

Detected by Pingoru
Mar 14, 2025, 04:56 PM UTC
Resolved
Mar 14, 2025, 09:13 PM UTC
Duration
4h 16m
Affected: Global API Ingestion
Timeline · 5 updates
  1. investigating Mar 14, 2025, 04:56 PM UTC

    Our engineering team has identified an issue causing ingestion delay for all Clusters 100 and above. We are deploying the fix right now. Customers on these impacted clusters could be experiencing delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 10:30 AM PDT or sooner.

  2. investigating Mar 14, 2025, 05:31 PM UTC

    The engineering team is still working on the fix, and customers on Clusters 100 and above will still be experiencing some delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 11:00 AM PDT or sooner.

  3. identified Mar 14, 2025, 06:07 PM UTC

    Engineering has put in the fix. Some clusters have fully recovered, and we are actively working on the ones that have not. For those customers that are still impacted, you will still be experiencing some delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 12:30 PM PDT or sooner.

  4. monitoring Mar 14, 2025, 07:30 PM UTC

    Engineering has finished pushing the fix. Customers on Clusters 116, 120, and 132 may still see an ingestion lag greater than 1 hour. All other Clusters have recovered. Data was never dropped. The next update is at 3 PM PDT or sooner.

  5. resolved Mar 14, 2025, 09:13 PM UTC

    Our engineering team has confirmed at 2:06 PDT that all ingestion should be back to normal and is stable. This incident is now resolved.

Read the full incident report →

Major March 7, 2025

Journeys are having issues fetching, creating and saving journeys.

Detected by Pingoru
Mar 07, 2025, 09:57 AM UTC
Resolved
Mar 07, 2025, 06:35 PM UTC
Duration
8h 38m
Affected: Global Web Application
Timeline · 3 updates
  1. investigating Mar 07, 2025, 09:57 AM UTC

    We are currently investigating an issue affecting Journeys where fetching, creation and saving of journeys is not working correctly. Customers may experience delays or unexpected behavior when using Journeys. Our Engineering team is actively working to identify the root cause. We will provide the next update in 60 minutes or sooner if more information becomes available.

  2. monitoring Mar 07, 2025, 10:32 AM UTC

    A fix has been deployed and journey function is operating normally. Moving to monitoring state.

  3. resolved Mar 07, 2025, 06:35 PM UTC

    We do not see any more errors in journey edits.

Read the full incident report →

Minor January 13, 2025

ingestion delays on c22

Detected by Pingoru
Jan 13, 2025, 10:41 PM UTC
Resolved
Jan 14, 2025, 01:03 AM UTC
Duration
2h 22m
Affected: User UpdatesList Uploads
Timeline · 4 updates
  1. identified Jan 13, 2025, 10:41 PM UTC

    Our engineers have identified and are deploying a fix for an issue with ingestion delays. The current delay is roughly 4 hours. This issue only affects customers on cluster 22. You can find your cluster id displayed on the Project Settings page. Next update by 3:30PST.

  2. identified Jan 13, 2025, 11:32 PM UTC

    Ingestion backlogs on c22 are shrinking due to the deployed fix. Next update by 4:30PST.

  3. monitoring Jan 14, 2025, 12:30 AM UTC

    Ingestion on c22 has fully caught up. Our engineering team is continuing to monitor the situation.

  4. resolved Jan 14, 2025, 01:03 AM UTC

    Our engineering team hasn't detected any additional anomalies, and ingestion has been stable. This incident is now resolved.

Read the full incident report →

Major January 7, 2025

Intermittent API errors

Detected by Pingoru
Jan 07, 2025, 07:56 PM UTC
Resolved
Jan 07, 2025, 10:10 PM UTC
Duration
2h 13m
Affected: Global API Success
Timeline · 3 updates
  1. investigating Jan 07, 2025, 07:56 PM UTC

    Our on-call engineers are investigating intermittent partial API outages lasting several minutes over the past couple hours. Our monitoring and alerting first detected the issue near 8:53 am PST. Next update by 1PM PST.

  2. monitoring Jan 07, 2025, 09:01 PM UTC

    Our engineering team implemented a fix for the issue and have seen no further API outages. We are continuing to monitor the situation.

  3. resolved Jan 07, 2025, 10:10 PM UTC

    We have not seen further API outages since the last update. This incident is now resolved.

Read the full incident report →

Major December 31, 2024

Event-Triggered Journeys Delays Ingesting New Users

Detected by Pingoru
Dec 31, 2024, 02:29 PM UTC
Resolved
Dec 31, 2024, 06:25 PM UTC
Duration
3h 55m
Affected: Global Web Application
Timeline · 3 updates
  1. identified Dec 31, 2024, 02:29 PM UTC

    Summary: Event-Triggered Journeys Delays Ingesting New Users. The issue originated from errors in the workflow-entrance-trigger pods, causing a significant backlog in processing. There is no impact to Scheduled Journeys and API Triggered Journeys . Actions Taken The workflow-entrance-trigger service was updated to the latest version, and additional pods were scaled up to process the backlog faster. The deployment resolved the issue, and error rates dropped significantly. Current Status The errors we were experiencing have been fixed since 5AM PST, now we're just monitoring the backlog as it drains. For 99% of clients, the backlog has drained completely, there are a few stragglers with small backlogs Next Steps Engineers will continue monitoring error rates and ensure the backlog clears entirely. Follow-up tasks include setting up error rate monitoring and addressing journey-specific issues to prevent recurrence.

  2. identified Dec 31, 2024, 04:11 PM UTC

    We are continuing to work on a fix for this issue.

  3. resolved Dec 31, 2024, 06:25 PM UTC

    This issue is now resolved and the backlog is completely drained. Event trigger journey's as of 7:25 AM PST are back to normal and processing as expected. If you still have any further questions please reach out to [email protected].

Read the full incident report →

Minor November 25, 2024

Segmentation page intermittent slowness

Detected by Pingoru
Nov 25, 2024, 05:07 PM UTC
Resolved
Nov 25, 2024, 08:18 PM UTC
Duration
3h 10m
Timeline · 5 updates
  1. investigating Nov 25, 2024, 05:07 PM UTC

    Intermittent slowness observed on segmentation page. The engineering team is currently investigating and we will update at 9:45am PT.

  2. identified Nov 25, 2024, 05:25 PM UTC

    The issue has been identified and fix has been implemented. The engineering team will continue monitoring systems and we will update at 10:15am PT or sooner.

  3. monitoring Nov 25, 2024, 05:47 PM UTC

    The issue has been mitigated. The engineering team will continue monitoring systems and we will update at 11:30am PT or sooner.

  4. monitoring Nov 25, 2024, 07:06 PM UTC

    The issue has been mitigated. The engineering team continues to monitor systems. If you are still seeing any impact please reach out to [email protected]

  5. resolved Nov 25, 2024, 08:18 PM UTC

    The issue has been resolved.

Read the full incident report →

Major November 24, 2024

Web Application Errors

Detected by Pingoru
Nov 24, 2024, 06:16 PM UTC
Resolved
Nov 25, 2024, 05:09 PM UTC
Duration
22h 52m
Affected: Global Web ApplicationGlobal API SuccessGlobal System WebhooksGlobal Partner WebhooksGlobal Campaign Sends
Timeline · 19 updates

Read the full incident report →

Minor November 22, 2024

Spike in web-app errors causing platform degredation

Detected by Pingoru
Nov 22, 2024, 03:15 PM UTC
Resolved
Nov 22, 2024, 08:20 PM UTC
Duration
5h 5m
Affected: Global Web Application
Timeline · 4 updates
  1. investigating Nov 22, 2024, 03:15 PM UTC

    Iterable engineering became alerted to high error rates on web-app this morning around 6:00 a.m. PT. After investigating the this issue it appears that these spikes could be happening at the top of each hour as higher volumes of campaigns are scheduled to be sent out. The first time frame of impact was from 6:02 - 6:11am PT and then again at 7am PT. Customers could be seeing a slowness in the web UI. Sends processing, journey processing, and data ingestion are not currently impacted. Next update at 8 am PT or sooner.

  2. monitoring Nov 22, 2024, 04:08 PM UTC

    Iterable engineering has mitigated the errors and slowdown affecting the web UI and API endpoints at the top of the hour. The last one occurred between 7:02 - 7:12 am PT. We are continuing to monitor. Next update by 9 am PT

  3. monitoring Nov 22, 2024, 05:11 PM UTC

    There have been no further application degradations, and the engineering team will continue monitoring through the morning. Next update by 12 pm PT or sooner

  4. resolved Nov 22, 2024, 08:20 PM UTC

    There have been no further application degradations and the platform looks stable. This will be our last update as we are marking this incident as resolved. Engineering team will continue to monitor closely.

Read the full incident report →

Minor November 20, 2024

Errors appearing and delays accessing the platform

Detected by Pingoru
Nov 20, 2024, 09:56 PM UTC
Resolved
Nov 21, 2024, 05:37 PM UTC
Duration
19h 41m
Affected: Global Web Application
Timeline · 6 updates
  1. investigating Nov 20, 2024, 09:56 PM UTC

    We are currently experiencing errors and slowness in accessing the Iterable platform. This is only impacting the Web Interface and is not impacting sends, journey processing. The engineering team is currently investigating. Next update by 3pm PT.

  2. monitoring Nov 20, 2024, 11:13 PM UTC

    Engineering has confirmed that the errors and slowness were from 1:30-1:50 PM PT. We have recovered since then, and we're continuing to monitor and work on further preventative measures. Next update by 4 pm PT.

  3. identified Nov 21, 2024, 12:10 AM UTC

    The engineering team has identified a fix for the intermittent degradations and is working on a remediation. We're continuing to monitor and add further preventative measures. Next update by 5 pm PT.

  4. monitoring Nov 21, 2024, 01:12 AM UTC

    The engineering team has deployed a fix for the intermittent degradations. We are continuing to monitor. If you are still seeing issues with platform slowness, please contact Iterable support. Iterable campaign sends and journey processing were not impacted during this incident. Next update at 7pm PT.

  5. monitoring Nov 21, 2024, 03:08 AM UTC

    There have been no further application degradations, and the engineering team will continue monitoring through the morning. Next update by 9am PT

  6. resolved Nov 21, 2024, 05:37 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor November 20, 2024

Website Degraded

Detected by Pingoru
Nov 20, 2024, 03:34 PM UTC
Resolved
Nov 20, 2024, 08:33 PM UTC
Duration
4h 59m
Affected: Global Web Application
Timeline · 3 updates
  1. investigating Nov 20, 2024, 03:34 PM UTC

    From 4:45 AM to 5:45 AM PT, we saw errors and slowness reaching app.iterable.com. Accessing all areas of the platform may have been met with slow loads, but sends and logins are not affected. The engineering team is currently investigating and we will update at 8:30am PT.

  2. monitoring Nov 20, 2024, 04:51 PM UTC

    Iterable Engineering has confirmed that we've recovered from the errors that were previously noted as reaching app.iterable.com. Users should be seeing platform speeds return to normal. Engineering is continuing to monitor and put preventative measures in place, but if you are still seeing any impact with Iterable pages loading slowly please reach out to [email protected]

  3. resolved Nov 20, 2024, 08:33 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice November 14, 2024

Issue With Dynamic Content For InApp Messages

Detected by Pingoru
Nov 14, 2024, 05:45 PM UTC
Resolved
Nov 14, 2024, 05:45 PM UTC
Duration
Affected: Global Campaign Sends
Timeline · 1 update
  1. resolved Nov 14, 2024, 05:45 PM UTC

    Today, November 14th around 5:00 am PST, Iterable was alerted to an issue where inApp campaigns were not rendering handlebars correctly. Iterable engineers began investigating the issue and identified the cause of the issue had started around 10:00 am PST on November 13th. Once they deployed their fix today at 7:30 am PST handlebars began rendering correctly again for all in App campaigns. During this impact window customers may have also seen issues with in App messages not sending / being fetched correctly in regards to dynamic content not being available. If you have any additional questions or think you may have been impacted please reach out to [email protected]

Read the full incident report →

Minor November 12, 2024

Ingestion Delay On Subset Of Clusters

Detected by Pingoru
Nov 12, 2024, 05:18 PM UTC
Resolved
Nov 12, 2024, 07:24 PM UTC
Duration
2h 5m
Affected: Email SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesList UploadsList UploadsList UploadsList UpdatesList UpdatesList UpdatesUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser Deletions
Timeline · 3 updates
  1. investigating Nov 12, 2024, 05:18 PM UTC

    Starting at around 6:30 am PT, Iterable Engineers were notified that lag on our ingestion topics was growing on a subset of clusters. Customers on these impacted clusters could be experiencing delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 10:00 an PT or Sooner.

  2. monitoring Nov 12, 2024, 05:52 PM UTC

    The engineering team has identified the cause and remediated the delays on these subset of clusters. Most impacted ingestion topics have recovered and customers should start to see data being ingested normally again. We are still working through some backlog, so the team will continue to monitor to ensure that these delays don't recur. Next Update at 11:00 am PT or sooner?

  3. resolved Nov 12, 2024, 07:24 PM UTC

    Ingestion has returned to normal for all impacted clusters and this issue is now resolved.

Read the full incident report →

Minor October 25, 2024

Delays in processing System Webhooks

Detected by Pingoru
Oct 25, 2024, 09:00 PM UTC
Resolved
Oct 26, 2024, 07:35 PM UTC
Duration
22h 35m
Affected: Global System Webhooks
Timeline · 3 updates
  1. identified Oct 25, 2024, 06:48 PM UTC

    The code fix for the slowness of System webhooks is deployed. Currently the queue is depleting. This could take several hours to deplete the backlog completely. During this time the performance will be degraded. Next update at 6pm PST.

  2. monitoring Oct 26, 2024, 12:16 AM UTC

    System webhook backlog is depleted and the incident is resolved.

  3. resolved Oct 26, 2024, 07:35 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice October 22, 2024

Embedded Messaging API down

Detected by Pingoru
Oct 22, 2024, 05:33 PM UTC
Resolved
Oct 23, 2024, 01:18 PM UTC
Duration
19h 45m
Timeline · 3 updates
  1. investigating Oct 22, 2024, 05:33 PM UTC

    Errors with our Embedded Messaging API are not occurring any more. The engineering team remediated the issue by clearing the offending database requests that were queued up. This is a temporary fix. A permanent remediation with code fix is still in progress.

  2. monitoring Oct 23, 2024, 12:01 AM UTC

    Errors with our Embedded Messaging API are not occurring any more. A code fix has been deployed.

  3. resolved Oct 23, 2024, 01:18 PM UTC

    This issue has been fully resolved as of October 22nd at 5pm PST.

Read the full incident report →

Major October 15, 2024

Issues impacting Email Sending

Detected by Pingoru
Oct 15, 2024, 04:06 PM UTC
Resolved
Oct 15, 2024, 09:32 PM UTC
Duration
5h 25m
Affected: Global Campaign SendsGlobal Proof Sends
Timeline · 7 updates
  1. monitoring Oct 15, 2024, 04:06 PM UTC

    Starting around 7:27 AM PST, Iterable engineers began receiving alerts about elevated error rates on a 3rd-party ESP provider's API endpoints. Iterable has been working directly with our 3rd party vendor and they have released a fix and as of 8:07 AM PST we are seeing successful email sends. Customers using this ESP may experience issues impacting email sends, including elevated email campaign send skips during this impact window Iterable Engineer will continue monitoring the issue. If you have any questions please reach out to [email protected]

  2. monitoring Oct 15, 2024, 04:49 PM UTC

    The fix from our third party vendor has been stable and we're seeing email campaigns are sending successfully again. We are currently working to resend the emails that were impacted during the impact window. Next update will be at 11:00 am pst or sooner.

  3. monitoring Oct 15, 2024, 06:05 PM UTC

    Engineering is identifying alternative methods to identify affected campaigns and will contact customers to confirm whether redelivery is needed. Additionally, 3rd party vendor recovered as of 8:07 am PT and there are no issues with send. Next update at 12 pm PT.

  4. monitoring Oct 15, 2024, 06:11 PM UTC

    We are continuing to monitor for any further issues.

  5. monitoring Oct 15, 2024, 06:12 PM UTC

    Engineering is identifying alternative methods to identify affected campaigns and will contact customers to confirm whether redelivery is needed. Additionally, the 3rd party vendor recovered as of 8:07 am PT, and there are no issues with send. Next update at 12 pm PT.

  6. monitoring Oct 15, 2024, 07:22 PM UTC

    Engineering is continuing to work on identifying alternative methods to determine affected campaigns. We will be reaching out to impacted customers to confirm whether redelivery is necessary. The 3rd party vendor recovered at 8:07 AM PT, and there are no ongoing issues with email sends.

  7. resolved Oct 15, 2024, 09:32 PM UTC

    As of 4:16 PM PST, all affected messages have been successfully processed, and the incident is now resolved. Our engineering team has confirmed that all campaigns have been reviewed, and there are no outstanding issues. The 3rd party vendor's recovery at 8:07 AM PT resolved the initial sending issue, and no further delays or interruptions are expected. We appreciate your patience during this time and apologize for any inconvenience caused. If you have any questions or need further assistance, please reach out to our support team at [email protected]

Read the full incident report →

Minor September 24, 2024

Data Ingestion Delay on c14

Detected by Pingoru
Sep 24, 2024, 01:34 PM UTC
Resolved
Sep 24, 2024, 03:48 PM UTC
Duration
2h 14m
Affected: User UpdatesList Uploads
Timeline · 3 updates
  1. investigating Sep 24, 2024, 01:34 PM UTC

    We are experiencing data ingestion delays on c14, affecting bulk updates, list uploads, and custom events. However, no data is being dropped.

  2. monitoring Sep 24, 2024, 03:08 PM UTC

    Engineering has put in the recovery steps, and are seeing ingestion begin to catch-up. Customers might still see some slight delays while we fully recover. We will update next when ingestion has completely caught up.

  3. resolved Sep 24, 2024, 03:48 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice September 10, 2024

Difficulty accessing the Iterable platform for APAC customers

Detected by Pingoru
Sep 10, 2024, 03:41 AM UTC
Resolved
Sep 10, 2024, 07:42 AM UTC
Duration
4h
Affected: Global Web Application
Timeline · 2 updates
  1. monitoring Sep 10, 2024, 03:41 AM UTC

    We're receiving reports from APAC customers of difficulty accessing the Iterable application. The issue appears to be caused by network connectivity between internet providers in APAC. Impacted customers can use a VPN to connect to the Iterable application. The Iterable Engineering team has confirmed there are no issues impacting the operations of the Iterable platform.

  2. resolved Sep 10, 2024, 07:42 AM UTC

    Impacted APAC customers have confirmed that they can now access the Iterable application without any issues.

Read the full incident report →

Major August 28, 2024

Journey processing halted

Detected by Pingoru
Aug 28, 2024, 10:49 PM UTC
Resolved
Aug 28, 2024, 11:42 PM UTC
Duration
52m
Timeline · 3 updates
  1. identified Aug 28, 2024, 10:49 PM UTC

    Around 2:20 PM PDT, we detected that journeys processing has halted. During this time, journeys will not start consuming users, any running journeys will be stuck, and no campaigns in journeys will send until we resolve the issue. No data will be lost, and they will be backlogged. Our team is actively working to resolve this as soon as possible. Next update will be be at or before 4:10 PM PDT.

  2. monitoring Aug 28, 2024, 11:07 PM UTC

    We have identified and fixed the problem that halted journey processing. Journeys are now processing through the backlog and are recovering with no data loss. If you triggered a journey during this time, it is not necessary to retrigger it. No action is needed to ensure your journey is processing again.

  3. resolved Aug 28, 2024, 11:42 PM UTC

    The backlog has been processed and journeys are processing normally. There is no action needed to ensure any journeys you triggered during this time has processed. If you still see any issues, please reach out to our Support team.

Read the full incident report →

Major August 15, 2024

Users unable to login and web

Detected by Pingoru
Aug 15, 2024, 04:00 PM UTC
Resolved
Aug 15, 2024, 08:47 PM UTC
Duration
4h 46m
Affected: Global Web Application
Timeline · 4 updates
  1. investigating Aug 15, 2024, 04:00 PM UTC

    We are currently investigating multiple reports of users unable to login and slow response in Web UI. Next update before 9:30AM

  2. investigating Aug 15, 2024, 04:03 PM UTC

    We are currently investigating multiple reports of users unable to login and slow response in Web UI. Next update before 9:30AM

  3. monitoring Aug 15, 2024, 04:37 PM UTC

    We believe the issue has been mitigated and that users should see access returning to normal. We are continuing to monitor. If you are still seeing issues, please contact Iterable support.

  4. resolved Aug 15, 2024, 08:47 PM UTC

    The platform has been stable, and users should no longer see any delays on page load or issues logging in. This incident is closed. If you are experiencing any issues, please reach out to Iterable support.

Read the full incident report →

Notice July 19, 2024

API failures across multiple clusters and degraded webApp performance

Detected by Pingoru
Jul 19, 2024, 03:37 PM UTC
Resolved
Jul 19, 2024, 07:10 PM UTC
Duration
3h 33m
Affected: Email SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingGlobal LinksPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser Deletions
Timeline · 7 updates
  1. investigating Jul 19, 2024, 03:37 PM UTC

    Beginning around 6:20 AM PST we were alerted to a spike in API errors across multiple endpoints impacting a number of specific customer clusters. All clusters numbered 100+ may be experiencing a spike in 5xx API errors across all endpoints. This may impact areas of the app such as scheduled and triggered Journeys, scheduled and triggered campaign sends, custom events, user updates, and more. Customers may also be experiencing webApp performance degradation as well including segmentation, list uploads, and viewing campaign details. Our engineering team is continuing work on identifying the underlying cause and exploring remediation options. Next update will be at 9 AM PST or sooner. If you have questions please reach out to [email protected]

  2. investigating Jul 19, 2024, 03:43 PM UTC

    We are continuing to investigate this issue.

  3. identified Jul 19, 2024, 04:16 PM UTC

    Our Engineers have identified the root cause of the issue and are actively working on deploying a fix. Currently our API endpoints have recovered, but in the meantime, customers may still be experiencing slowness in scheduled and triggered Journeys, scheduled and triggered campaign sends, custom events, user updates, and more. Customers may also be experiencing webApp performance degradation as well including segmentation, list uploads, and viewing campaign details. While the fix is being deployed, to clarify, this issue is specifically impacting All clusters numbered 100+. Our next update will be at 10:00 AM PT or sooner.

  4. monitoring Jul 19, 2024, 05:10 PM UTC

    Web app and API endpoints have completely recovered at this point. However, there are still a subset of customers that may be experiencing an ingestion lag that is currently draining. These customers may still be seeing delays in user updates, event calls, and event triggered journeys. We are continuing to monitor this and will provide our next update at 11 AM PT or sooner.

  5. monitoring Jul 19, 2024, 05:38 PM UTC

    We are continuing to monitor for any further issues.

  6. monitoring Jul 19, 2024, 06:00 PM UTC

    As of now we have completely caught up on ingestion lag with all services returning to normal. We will continue to monitor performance with that next update at 12 PM PT or sooner.

  7. resolved Jul 19, 2024, 07:10 PM UTC

    We have fully recovered from this incident and are marking it resolved. If you have any further questions please reach out to [email protected]

Read the full incident report →