- Detected by Pingoru
- May 19, 2025, 09:42 PM UTC
- Resolved
- May 19, 2025, 10:28 PM UTC
- Duration
- 46m
Timeline · 4 updates
-
investigating May 19, 2025, 09:42 PM UTC
Mailgun is reporting degraded performance across their application https://status.mailgun.com/. We are monitoring mailgun and their performance.
-
monitoring May 19, 2025, 09:43 PM UTC
We are monitoring the affect of the mailgun degradation
-
monitoring May 19, 2025, 09:54 PM UTC
Mailgun has implemented a fix and is in Monitoring status
-
resolved May 19, 2025, 10:28 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Apr 23, 2025, 06:50 AM UTC
- Resolved
- Apr 24, 2025, 11:17 PM UTC
- Duration
- 1d 16h
Affected: Email SendsJourney ProcessingPush SendsSMS SendsUser UpdatesList UpdatesUser Deletions
Timeline · 8 updates
Read the full incident report →
- Detected by Pingoru
- Mar 14, 2025, 04:56 PM UTC
- Resolved
- Mar 14, 2025, 09:13 PM UTC
- Duration
- 4h 16m
Affected: Global API Ingestion
Timeline · 5 updates
-
investigating Mar 14, 2025, 04:56 PM UTC
Our engineering team has identified an issue causing ingestion delay for all Clusters 100 and above. We are deploying the fix right now. Customers on these impacted clusters could be experiencing delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 10:30 AM PDT or sooner.
-
investigating Mar 14, 2025, 05:31 PM UTC
The engineering team is still working on the fix, and customers on Clusters 100 and above will still be experiencing some delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 11:00 AM PDT or sooner.
-
identified Mar 14, 2025, 06:07 PM UTC
Engineering has put in the fix. Some clusters have fully recovered, and we are actively working on the ones that have not. For those customers that are still impacted, you will still be experiencing some delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 12:30 PM PDT or sooner.
-
monitoring Mar 14, 2025, 07:30 PM UTC
Engineering has finished pushing the fix. Customers on Clusters 116, 120, and 132 may still see an ingestion lag greater than 1 hour. All other Clusters have recovered. Data was never dropped. The next update is at 3 PM PDT or sooner.
-
resolved Mar 14, 2025, 09:13 PM UTC
Our engineering team has confirmed at 2:06 PDT that all ingestion should be back to normal and is stable. This incident is now resolved.
Read the full incident report →
- Detected by Pingoru
- Mar 12, 2025, 03:45 PM UTC
- Resolved
- Mar 18, 2025, 08:11 PM UTC
- Duration
- 6d 4h
Affected: Global Campaign Sends
Timeline · 15 updates
Read the full incident report →
- Detected by Pingoru
- Mar 07, 2025, 09:57 AM UTC
- Resolved
- Mar 07, 2025, 06:35 PM UTC
- Duration
- 8h 38m
Affected: Global Web Application
Timeline · 3 updates
-
investigating Mar 07, 2025, 09:57 AM UTC
We are currently investigating an issue affecting Journeys where fetching, creation and saving of journeys is not working correctly. Customers may experience delays or unexpected behavior when using Journeys. Our Engineering team is actively working to identify the root cause. We will provide the next update in 60 minutes or sooner if more information becomes available.
-
monitoring Mar 07, 2025, 10:32 AM UTC
A fix has been deployed and journey function is operating normally. Moving to monitoring state.
-
resolved Mar 07, 2025, 06:35 PM UTC
We do not see any more errors in journey edits.
Read the full incident report →
- Detected by Pingoru
- Jan 13, 2025, 10:41 PM UTC
- Resolved
- Jan 14, 2025, 01:03 AM UTC
- Duration
- 2h 22m
Affected: User UpdatesList Uploads
Timeline · 4 updates
-
identified Jan 13, 2025, 10:41 PM UTC
Our engineers have identified and are deploying a fix for an issue with ingestion delays. The current delay is roughly 4 hours. This issue only affects customers on cluster 22. You can find your cluster id displayed on the Project Settings page. Next update by 3:30PST.
-
identified Jan 13, 2025, 11:32 PM UTC
Ingestion backlogs on c22 are shrinking due to the deployed fix. Next update by 4:30PST.
-
monitoring Jan 14, 2025, 12:30 AM UTC
Ingestion on c22 has fully caught up. Our engineering team is continuing to monitor the situation.
-
resolved Jan 14, 2025, 01:03 AM UTC
Our engineering team hasn't detected any additional anomalies, and ingestion has been stable. This incident is now resolved.
Read the full incident report →
- Detected by Pingoru
- Jan 07, 2025, 07:56 PM UTC
- Resolved
- Jan 07, 2025, 10:10 PM UTC
- Duration
- 2h 13m
Affected: Global API Success
Timeline · 3 updates
-
investigating Jan 07, 2025, 07:56 PM UTC
Our on-call engineers are investigating intermittent partial API outages lasting several minutes over the past couple hours. Our monitoring and alerting first detected the issue near 8:53 am PST. Next update by 1PM PST.
-
monitoring Jan 07, 2025, 09:01 PM UTC
Our engineering team implemented a fix for the issue and have seen no further API outages. We are continuing to monitor the situation.
-
resolved Jan 07, 2025, 10:10 PM UTC
We have not seen further API outages since the last update. This incident is now resolved.
Read the full incident report →
- Detected by Pingoru
- Dec 31, 2024, 02:29 PM UTC
- Resolved
- Dec 31, 2024, 06:25 PM UTC
- Duration
- 3h 55m
Affected: Global Web Application
Timeline · 3 updates
-
identified Dec 31, 2024, 02:29 PM UTC
Summary: Event-Triggered Journeys Delays Ingesting New Users. The issue originated from errors in the workflow-entrance-trigger pods, causing a significant backlog in processing. There is no impact to Scheduled Journeys and API Triggered Journeys . Actions Taken The workflow-entrance-trigger service was updated to the latest version, and additional pods were scaled up to process the backlog faster. The deployment resolved the issue, and error rates dropped significantly. Current Status The errors we were experiencing have been fixed since 5AM PST, now we're just monitoring the backlog as it drains. For 99% of clients, the backlog has drained completely, there are a few stragglers with small backlogs Next Steps Engineers will continue monitoring error rates and ensure the backlog clears entirely. Follow-up tasks include setting up error rate monitoring and addressing journey-specific issues to prevent recurrence.
-
identified Dec 31, 2024, 04:11 PM UTC
We are continuing to work on a fix for this issue.
-
resolved Dec 31, 2024, 06:25 PM UTC
This issue is now resolved and the backlog is completely drained. Event trigger journey's as of 7:25 AM PST are back to normal and processing as expected. If you still have any further questions please reach out to [email protected].
Read the full incident report →
- Detected by Pingoru
- Nov 25, 2024, 05:07 PM UTC
- Resolved
- Nov 25, 2024, 08:18 PM UTC
- Duration
- 3h 10m
Timeline · 5 updates
-
investigating Nov 25, 2024, 05:07 PM UTC
Intermittent slowness observed on segmentation page. The engineering team is currently investigating and we will update at 9:45am PT.
-
identified Nov 25, 2024, 05:25 PM UTC
The issue has been identified and fix has been implemented. The engineering team will continue monitoring systems and we will update at 10:15am PT or sooner.
-
monitoring Nov 25, 2024, 05:47 PM UTC
The issue has been mitigated. The engineering team will continue monitoring systems and we will update at 11:30am PT or sooner.
-
monitoring Nov 25, 2024, 07:06 PM UTC
The issue has been mitigated. The engineering team continues to monitor systems. If you are still seeing any impact please reach out to [email protected]
-
resolved Nov 25, 2024, 08:18 PM UTC
The issue has been resolved.
Read the full incident report →
- Detected by Pingoru
- Nov 24, 2024, 06:16 PM UTC
- Resolved
- Nov 25, 2024, 05:09 PM UTC
- Duration
- 22h 52m
Affected: Global Web ApplicationGlobal API SuccessGlobal System WebhooksGlobal Partner WebhooksGlobal Campaign Sends
Timeline · 19 updates
Read the full incident report →
- Detected by Pingoru
- Nov 22, 2024, 03:15 PM UTC
- Resolved
- Nov 22, 2024, 08:20 PM UTC
- Duration
- 5h 5m
Affected: Global Web Application
Timeline · 4 updates
-
investigating Nov 22, 2024, 03:15 PM UTC
Iterable engineering became alerted to high error rates on web-app this morning around 6:00 a.m. PT. After investigating the this issue it appears that these spikes could be happening at the top of each hour as higher volumes of campaigns are scheduled to be sent out. The first time frame of impact was from 6:02 - 6:11am PT and then again at 7am PT. Customers could be seeing a slowness in the web UI. Sends processing, journey processing, and data ingestion are not currently impacted. Next update at 8 am PT or sooner.
-
monitoring Nov 22, 2024, 04:08 PM UTC
Iterable engineering has mitigated the errors and slowdown affecting the web UI and API endpoints at the top of the hour. The last one occurred between 7:02 - 7:12 am PT. We are continuing to monitor. Next update by 9 am PT
-
monitoring Nov 22, 2024, 05:11 PM UTC
There have been no further application degradations, and the engineering team will continue monitoring through the morning. Next update by 12 pm PT or sooner
-
resolved Nov 22, 2024, 08:20 PM UTC
There have been no further application degradations and the platform looks stable. This will be our last update as we are marking this incident as resolved. Engineering team will continue to monitor closely.
Read the full incident report →
- Detected by Pingoru
- Nov 20, 2024, 09:56 PM UTC
- Resolved
- Nov 21, 2024, 05:37 PM UTC
- Duration
- 19h 41m
Affected: Global Web Application
Timeline · 6 updates
-
investigating Nov 20, 2024, 09:56 PM UTC
We are currently experiencing errors and slowness in accessing the Iterable platform. This is only impacting the Web Interface and is not impacting sends, journey processing. The engineering team is currently investigating. Next update by 3pm PT.
-
monitoring Nov 20, 2024, 11:13 PM UTC
Engineering has confirmed that the errors and slowness were from 1:30-1:50 PM PT. We have recovered since then, and we're continuing to monitor and work on further preventative measures. Next update by 4 pm PT.
-
identified Nov 21, 2024, 12:10 AM UTC
The engineering team has identified a fix for the intermittent degradations and is working on a remediation. We're continuing to monitor and add further preventative measures. Next update by 5 pm PT.
-
monitoring Nov 21, 2024, 01:12 AM UTC
The engineering team has deployed a fix for the intermittent degradations. We are continuing to monitor. If you are still seeing issues with platform slowness, please contact Iterable support. Iterable campaign sends and journey processing were not impacted during this incident. Next update at 7pm PT.
-
monitoring Nov 21, 2024, 03:08 AM UTC
There have been no further application degradations, and the engineering team will continue monitoring through the morning. Next update by 9am PT
-
resolved Nov 21, 2024, 05:37 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Nov 20, 2024, 03:34 PM UTC
- Resolved
- Nov 20, 2024, 08:33 PM UTC
- Duration
- 4h 59m
Affected: Global Web Application
Timeline · 3 updates
-
investigating Nov 20, 2024, 03:34 PM UTC
From 4:45 AM to 5:45 AM PT, we saw errors and slowness reaching app.iterable.com. Accessing all areas of the platform may have been met with slow loads, but sends and logins are not affected. The engineering team is currently investigating and we will update at 8:30am PT.
-
monitoring Nov 20, 2024, 04:51 PM UTC
Iterable Engineering has confirmed that we've recovered from the errors that were previously noted as reaching app.iterable.com. Users should be seeing platform speeds return to normal. Engineering is continuing to monitor and put preventative measures in place, but if you are still seeing any impact with Iterable pages loading slowly please reach out to [email protected]
-
resolved Nov 20, 2024, 08:33 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Nov 14, 2024, 05:45 PM UTC
- Resolved
- Nov 14, 2024, 05:45 PM UTC
- Duration
- —
Affected: Global Campaign Sends
Timeline · 1 update
-
resolved Nov 14, 2024, 05:45 PM UTC
Today, November 14th around 5:00 am PST, Iterable was alerted to an issue where inApp campaigns were not rendering handlebars correctly. Iterable engineers began investigating the issue and identified the cause of the issue had started around 10:00 am PST on November 13th. Once they deployed their fix today at 7:30 am PST handlebars began rendering correctly again for all in App campaigns. During this impact window customers may have also seen issues with in App messages not sending / being fetched correctly in regards to dynamic content not being available. If you have any additional questions or think you may have been impacted please reach out to [email protected]
Read the full incident report →
- Detected by Pingoru
- Nov 12, 2024, 05:18 PM UTC
- Resolved
- Nov 12, 2024, 07:24 PM UTC
- Duration
- 2h 5m
Affected: Email SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesList UploadsList UploadsList UploadsList UpdatesList UpdatesList UpdatesUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser Deletions
Timeline · 3 updates
-
investigating Nov 12, 2024, 05:18 PM UTC
Starting at around 6:30 am PT, Iterable Engineers were notified that lag on our ingestion topics was growing on a subset of clusters. Customers on these impacted clusters could be experiencing delays with user updates, list uploads, user deletion, and event processing. Email sends and journey processing may see impact as well. Data is only being delayed. It is not being dropped. The engineering team is working to remediate these delays. Next update at 10:00 an PT or Sooner.
-
monitoring Nov 12, 2024, 05:52 PM UTC
The engineering team has identified the cause and remediated the delays on these subset of clusters. Most impacted ingestion topics have recovered and customers should start to see data being ingested normally again. We are still working through some backlog, so the team will continue to monitor to ensure that these delays don't recur. Next Update at 11:00 am PT or sooner?
-
resolved Nov 12, 2024, 07:24 PM UTC
Ingestion has returned to normal for all impacted clusters and this issue is now resolved.
Read the full incident report →
- Detected by Pingoru
- Oct 25, 2024, 09:00 PM UTC
- Resolved
- Oct 26, 2024, 07:35 PM UTC
- Duration
- 22h 35m
Affected: Global System Webhooks
Timeline · 3 updates
-
identified Oct 25, 2024, 06:48 PM UTC
The code fix for the slowness of System webhooks is deployed. Currently the queue is depleting. This could take several hours to deplete the backlog completely. During this time the performance will be degraded. Next update at 6pm PST.
-
monitoring Oct 26, 2024, 12:16 AM UTC
System webhook backlog is depleted and the incident is resolved.
-
resolved Oct 26, 2024, 07:35 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Oct 22, 2024, 05:33 PM UTC
- Resolved
- Oct 23, 2024, 01:18 PM UTC
- Duration
- 19h 45m
Timeline · 3 updates
-
investigating Oct 22, 2024, 05:33 PM UTC
Errors with our Embedded Messaging API are not occurring any more. The engineering team remediated the issue by clearing the offending database requests that were queued up. This is a temporary fix. A permanent remediation with code fix is still in progress.
-
monitoring Oct 23, 2024, 12:01 AM UTC
Errors with our Embedded Messaging API are not occurring any more. A code fix has been deployed.
-
resolved Oct 23, 2024, 01:18 PM UTC
This issue has been fully resolved as of October 22nd at 5pm PST.
Read the full incident report →
- Detected by Pingoru
- Oct 15, 2024, 04:06 PM UTC
- Resolved
- Oct 15, 2024, 09:32 PM UTC
- Duration
- 5h 25m
Affected: Global Campaign SendsGlobal Proof Sends
Timeline · 7 updates
-
monitoring Oct 15, 2024, 04:06 PM UTC
Starting around 7:27 AM PST, Iterable engineers began receiving alerts about elevated error rates on a 3rd-party ESP provider's API endpoints. Iterable has been working directly with our 3rd party vendor and they have released a fix and as of 8:07 AM PST we are seeing successful email sends. Customers using this ESP may experience issues impacting email sends, including elevated email campaign send skips during this impact window Iterable Engineer will continue monitoring the issue. If you have any questions please reach out to [email protected]
-
monitoring Oct 15, 2024, 04:49 PM UTC
The fix from our third party vendor has been stable and we're seeing email campaigns are sending successfully again. We are currently working to resend the emails that were impacted during the impact window. Next update will be at 11:00 am pst or sooner.
-
monitoring Oct 15, 2024, 06:05 PM UTC
Engineering is identifying alternative methods to identify affected campaigns and will contact customers to confirm whether redelivery is needed. Additionally, 3rd party vendor recovered as of 8:07 am PT and there are no issues with send. Next update at 12 pm PT.
-
monitoring Oct 15, 2024, 06:11 PM UTC
We are continuing to monitor for any further issues.
-
monitoring Oct 15, 2024, 06:12 PM UTC
Engineering is identifying alternative methods to identify affected campaigns and will contact customers to confirm whether redelivery is needed. Additionally, the 3rd party vendor recovered as of 8:07 am PT, and there are no issues with send. Next update at 12 pm PT.
-
monitoring Oct 15, 2024, 07:22 PM UTC
Engineering is continuing to work on identifying alternative methods to determine affected campaigns. We will be reaching out to impacted customers to confirm whether redelivery is necessary. The 3rd party vendor recovered at 8:07 AM PT, and there are no ongoing issues with email sends.
-
resolved Oct 15, 2024, 09:32 PM UTC
As of 4:16 PM PST, all affected messages have been successfully processed, and the incident is now resolved. Our engineering team has confirmed that all campaigns have been reviewed, and there are no outstanding issues. The 3rd party vendor's recovery at 8:07 AM PT resolved the initial sending issue, and no further delays or interruptions are expected. We appreciate your patience during this time and apologize for any inconvenience caused. If you have any questions or need further assistance, please reach out to our support team at [email protected]
Read the full incident report →
- Detected by Pingoru
- Sep 24, 2024, 01:34 PM UTC
- Resolved
- Sep 24, 2024, 03:48 PM UTC
- Duration
- 2h 14m
Affected: User UpdatesList Uploads
Timeline · 3 updates
-
investigating Sep 24, 2024, 01:34 PM UTC
We are experiencing data ingestion delays on c14, affecting bulk updates, list uploads, and custom events. However, no data is being dropped.
-
monitoring Sep 24, 2024, 03:08 PM UTC
Engineering has put in the recovery steps, and are seeing ingestion begin to catch-up. Customers might still see some slight delays while we fully recover. We will update next when ingestion has completely caught up.
-
resolved Sep 24, 2024, 03:48 PM UTC
This incident has been resolved.
Read the full incident report →
Notice September 10, 2024 - Detected by Pingoru
- Sep 10, 2024, 03:41 AM UTC
- Resolved
- Sep 10, 2024, 07:42 AM UTC
- Duration
- 4h
Affected: Global Web Application
Timeline · 2 updates
-
monitoring Sep 10, 2024, 03:41 AM UTC
We're receiving reports from APAC customers of difficulty accessing the Iterable application. The issue appears to be caused by network connectivity between internet providers in APAC. Impacted customers can use a VPN to connect to the Iterable application. The Iterable Engineering team has confirmed there are no issues impacting the operations of the Iterable platform.
-
resolved Sep 10, 2024, 07:42 AM UTC
Impacted APAC customers have confirmed that they can now access the Iterable application without any issues.
Read the full incident report →
- Detected by Pingoru
- Aug 28, 2024, 10:49 PM UTC
- Resolved
- Aug 28, 2024, 11:42 PM UTC
- Duration
- 52m
Timeline · 3 updates
-
identified Aug 28, 2024, 10:49 PM UTC
Around 2:20 PM PDT, we detected that journeys processing has halted. During this time, journeys will not start consuming users, any running journeys will be stuck, and no campaigns in journeys will send until we resolve the issue. No data will be lost, and they will be backlogged. Our team is actively working to resolve this as soon as possible. Next update will be be at or before 4:10 PM PDT.
-
monitoring Aug 28, 2024, 11:07 PM UTC
We have identified and fixed the problem that halted journey processing. Journeys are now processing through the backlog and are recovering with no data loss. If you triggered a journey during this time, it is not necessary to retrigger it. No action is needed to ensure your journey is processing again.
-
resolved Aug 28, 2024, 11:42 PM UTC
The backlog has been processed and journeys are processing normally. There is no action needed to ensure any journeys you triggered during this time has processed. If you still see any issues, please reach out to our Support team.
Read the full incident report →
- Detected by Pingoru
- Aug 15, 2024, 04:00 PM UTC
- Resolved
- Aug 15, 2024, 08:47 PM UTC
- Duration
- 4h 46m
Affected: Global Web Application
Timeline · 4 updates
-
investigating Aug 15, 2024, 04:00 PM UTC
We are currently investigating multiple reports of users unable to login and slow response in Web UI. Next update before 9:30AM
-
investigating Aug 15, 2024, 04:03 PM UTC
We are currently investigating multiple reports of users unable to login and slow response in Web UI. Next update before 9:30AM
-
monitoring Aug 15, 2024, 04:37 PM UTC
We believe the issue has been mitigated and that users should see access returning to normal. We are continuing to monitor. If you are still seeing issues, please contact Iterable support.
-
resolved Aug 15, 2024, 08:47 PM UTC
The platform has been stable, and users should no longer see any delays on page load or issues logging in. This incident is closed. If you are experiencing any issues, please reach out to Iterable support.
Read the full incident report →
- Detected by Pingoru
- Jul 19, 2024, 03:37 PM UTC
- Resolved
- Jul 19, 2024, 07:10 PM UTC
- Duration
- 3h 33m
Affected: Email SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsEmail SendsJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingJourney ProcessingGlobal LinksPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsPush SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsSMS SendsUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesUser UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesList UpdatesUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser DeletionsUser Deletions
Timeline · 7 updates
-
investigating Jul 19, 2024, 03:37 PM UTC
Beginning around 6:20 AM PST we were alerted to a spike in API errors across multiple endpoints impacting a number of specific customer clusters. All clusters numbered 100+ may be experiencing a spike in 5xx API errors across all endpoints. This may impact areas of the app such as scheduled and triggered Journeys, scheduled and triggered campaign sends, custom events, user updates, and more. Customers may also be experiencing webApp performance degradation as well including segmentation, list uploads, and viewing campaign details. Our engineering team is continuing work on identifying the underlying cause and exploring remediation options. Next update will be at 9 AM PST or sooner. If you have questions please reach out to [email protected]
-
investigating Jul 19, 2024, 03:43 PM UTC
We are continuing to investigate this issue.
-
identified Jul 19, 2024, 04:16 PM UTC
Our Engineers have identified the root cause of the issue and are actively working on deploying a fix. Currently our API endpoints have recovered, but in the meantime, customers may still be experiencing slowness in scheduled and triggered Journeys, scheduled and triggered campaign sends, custom events, user updates, and more. Customers may also be experiencing webApp performance degradation as well including segmentation, list uploads, and viewing campaign details. While the fix is being deployed, to clarify, this issue is specifically impacting All clusters numbered 100+. Our next update will be at 10:00 AM PT or sooner.
-
monitoring Jul 19, 2024, 05:10 PM UTC
Web app and API endpoints have completely recovered at this point. However, there are still a subset of customers that may be experiencing an ingestion lag that is currently draining. These customers may still be seeing delays in user updates, event calls, and event triggered journeys. We are continuing to monitor this and will provide our next update at 11 AM PT or sooner.
-
monitoring Jul 19, 2024, 05:38 PM UTC
We are continuing to monitor for any further issues.
-
monitoring Jul 19, 2024, 06:00 PM UTC
As of now we have completely caught up on ingestion lag with all services returning to normal. We will continue to monitor performance with that next update at 12 PM PT or sooner.
-
resolved Jul 19, 2024, 07:10 PM UTC
We have fully recovered from this incident and are marking it resolved. If you have any further questions please reach out to [email protected]
Read the full incident report →
- Detected by Pingoru
- Jul 05, 2024, 08:44 PM UTC
- Resolved
- Jul 06, 2024, 01:35 AM UTC
- Duration
- 4h 50m
Affected: Global API SuccessGlobal API Ingestion
Timeline · 11 updates
Read the full incident report →