Sauce Labs Outage History

Sauce Labs is up right now

Sauce Labs had 50 outages in the last 2 years totaling 42h 34m of downtime — averaging 2.1 incidents per month.

There were 50 Sauce Labs outages since October 30, 2024 totaling 42h 34m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.saucelabs.com

Notice September 19, 2025

2025-September-19 Resolved Service Incident

Detected by Pingoru
Sep 19, 2025, 08:54 AM UTC
Resolved
Sep 19, 2025, 05:30 AM UTC
Duration
Timeline · 1 update
  1. resolved Sep 19, 2025, 08:54 AM UTC

    Between 06:30 and 08:35 UTC we were seeing errors when selecting and starting Live and Automated Android Emulator tests in US-West-1 & EU-Central-1 Data Centers. This has now been resolved.

Read the full incident report →

Major September 3, 2025

2025-September-3 Service Incident

Detected by Pingoru
Sep 03, 2025, 04:34 PM UTC
Resolved
Sep 03, 2025, 05:10 PM UTC
Duration
35m
Affected: EU-CentralEU-CentralEU-Central
Timeline · 3 updates
  1. investigating Sep 03, 2025, 04:34 PM UTC

    We are currently seeing app storage upload issues in the EU datacenter. We are investigating.

  2. resolved Sep 03, 2025, 05:10 PM UTC

    After taking remedial action, the app storage service has been restored in the EU Data Center. All services are fully operational.

  3. postmortem Dec 11, 2025, 04:00 PM UTC

    ### **Dates:** Wednesday September 5th 2025, 13:50 UTC - 16:31 UTC ### **What happened:** The App Storage Service in the EU Central 1 data center experienced elevated response times, leading to timeout errors with uploading applications. ### **Why it happened:** The App Storage Service experienced connection issues with the backend. ### **How we fixed it:** The connection issues were resolved by restarting the affected application. ### **What we are doing to prevent it from happening again:** We have improved observability signals for the App Storage Service and implemented regular scheduled restarts of the application.

Read the full incident report →

Notice August 19, 2025

2025-August-19 Resolved Service Incident

Detected by Pingoru
Aug 19, 2025, 07:15 PM UTC
Resolved
Aug 19, 2025, 07:00 PM UTC
Duration
Timeline · 2 updates
  1. resolved Aug 19, 2025, 07:15 PM UTC

    Between 16:15 -16:55 UTC, automated Android Real Device tests were intermittently failing to start in our US West Data Center. We took remedial action and the problem was resolved. All services are fully operational.

  2. postmortem Sep 10, 2025, 02:45 PM UTC

    ### **Dates:** Friday August 19th 2025, 16:20 UTC - 16:55 UTC. ### **What happened:** During the times stated, automated Android real device tests were failing to start intermittently in the US-West-1 datacenter. ### **Why it happened:** A defect was introduced by a deployment which impacted Appium tests. ### **How we fixed it:** The deployment was rolled back. ### **What we are doing to prevent it from happening again:** We have improved our internal test cases to detect the conditions in which the defect occurred.

Read the full incident report →

Major July 22, 2025

2025-July-22 Service Incident

Detected by Pingoru
Jul 22, 2025, 08:50 AM UTC
Resolved
Jul 22, 2025, 01:19 PM UTC
Duration
4h 29m
Affected: US-WestUS-WestUS-WestUS-WestUS-West
Timeline · 5 updates
  1. investigating Jul 22, 2025, 08:50 AM UTC

    We are currently seeing intermittent errors accessing the US-West-1 Sauce Labs dashboard. We are investigating.

  2. investigating Jul 22, 2025, 09:53 AM UTC

    We are continuing to investigate this issue.

  3. monitoring Jul 22, 2025, 12:07 PM UTC

    We have been seeing intermittent errors accessing the US-West-1 Sauce Labs dashboard. We have identified a solution and have deployed a fix for this issue. We are monitoring.

  4. resolved Jul 22, 2025, 01:19 PM UTC

    After taking remedial action, the Sauce Labs dashboard is now stable and available. This incident is resolved.

  5. postmortem Aug 20, 2025, 10:02 AM UTC

    ### **Dates:** Tuesday July 22d 2025, 07:28 UTC - 15:07 UTC. ### **What happened:** The Sauce Labs dashboard for our US-West-1 datacenter was unavailable intermittently during the incident timeline. ### **Why it happened:** This issue was caused by Cache poisoning via a specific header. ### **How we fixed it:** The header settings were reviewed and fixed in our proxies. ### **What we are doing to prevent it from happening again:** We have added stricter rules to prevent further cache poisoning attempts.

Read the full incident report →

Critical July 2, 2025

2025-July-2 Service Incident

Detected by Pingoru
Jul 02, 2025, 10:11 AM UTC
Resolved
Jul 02, 2025, 08:11 PM UTC
Duration
9h 59m
Affected: EU-CentralEU-CentralEU-Central
Timeline · 6 updates
  1. investigating Jul 02, 2025, 10:11 AM UTC

    We are currently experiencing reduced availability of Real Devices in our EU-Central-1 datacenter. We are investigating.

  2. investigating Jul 02, 2025, 11:12 AM UTC

    We are continuing to see reduced availability of Real Devices in our EU-Central-1 datacenter. We are continuing to investigate.

  3. investigating Jul 02, 2025, 11:50 AM UTC

    We have identified a networking issue in the EU-Central-1 datacenter, resulting in a high error rate in EU Real Device tests. We are continuing to investigate.

  4. investigating Jul 02, 2025, 04:19 PM UTC

    We are actively working on a solution to resolve the networking issues being seen in the EU datacenter, which involves replacing failed hardware. A high error rate will continue to be seen with EU Real Device tests in the meantime.

  5. resolved Jul 02, 2025, 08:11 PM UTC

    Our Android devices are now fully operational in EU-Central Data Center. We are currently experiencing a slight reduction in available public and private iOS devices in the EU-Central Data Center, which is leading to degraded availability for our iOS user base. We have identified the root cause and implementing a final fix. We expect to restore full availability for all iOS users by tomorrow, Thursday, July 3, 2025, end of day

  6. postmortem Jul 16, 2025, 10:45 AM UTC

    ### **Dates:** Wednesday July 2nd 2025, 09:74 UTC - 17:19 UTC. ### **What happened:** Approximately half of our real devices in the EU-Central-1 datacenter became unavailable for testing. ### **Why it happened:** There was a critical failure with a device within our network infrastructure. ### **How we fixed it:** The issue was resolved by replacing the failed device. ### **What we are doing to prevent it from happening again:** We are reviewing the current network infrastructure strategy to improve resiliency.

Read the full incident report →

Major June 18, 2025

2025-June-18 Service Incident

Detected by Pingoru
Jun 18, 2025, 11:53 AM UTC
Resolved
Jun 18, 2025, 12:54 PM UTC
Duration
1h
Affected: EU-CentralEU-CentralEU-CentralEU-CentralEU-Central
Timeline · 3 updates
  1. investigating Jun 18, 2025, 11:53 AM UTC

    We are currently experiencing issues with our app storage service in the EU Data Center. We are investigating.

  2. resolved Jun 18, 2025, 12:54 PM UTC

    After taking remedial action, the app storage service has been restored in the EU Data Center. All services are fully operational.

  3. postmortem Jul 07, 2025, 08:56 AM UTC

    ### **Dates:** Wednesday June 18th 2025, 11:04 UTC - 12:23 UTC ### **What happened:** The service responsible for App storage and retrieval experienced long response times. ### **Why it happened:** The App storage service was experiencing connectivity issues. ### **How we fixed it:** The connectivity issues disappeared before any remediation actions could be taken.. ### **What we are doing to prevent it from happening again:** We have added tracing validations for the app storage service connectivity to give earlier visibility of any future occurrences.

Read the full incident report →

Major June 16, 2025

2025-June-16 Service Incident

Detected by Pingoru
Jun 16, 2025, 12:49 PM UTC
Resolved
Jun 16, 2025, 01:29 PM UTC
Duration
39m
Affected: EU-CentralEU-CentralEU-Central
Timeline · 3 updates
  1. investigating Jun 16, 2025, 12:49 PM UTC

    We are currently experiencing reduced availability of Real Devices in our EU-Central-1 datacenter. We are investigating.

  2. resolved Jun 16, 2025, 01:29 PM UTC

    After taking remedial action, we are seeing normal availability levels of Real Devices in our EU-Central-1 datacenter. This incident is resolved.

  3. postmortem Jun 20, 2025, 08:17 AM UTC

    ### **Dates:** Monday June 16th 2025, 12:25 UTC - 13:20 UTC ### **What happened:** Approximately 50% of RDC devices in the EU-Central-1 datacenter were unavailable. ### **Why it happened:** Maintenance on the datacenter power supply led to devices being shut down. ### **How we fixed it:** The affected devices were restarted. ### **What we are doing to prevent it from happening again:** We are looking to add more power redundancy and create established maintenance windows.

Read the full incident report →

Notice June 13, 2025

2025-June-13 Resolved Service Incident

Detected by Pingoru
Jun 13, 2025, 11:20 AM UTC
Resolved
Jun 13, 2025, 06:30 AM UTC
Duration
Timeline · 2 updates
  1. resolved Jun 13, 2025, 11:20 AM UTC

    Between 07:30 - 09:00 UTC, There was an issue with the Saucelabs Dashboard User Interface, this caused problems accessing apps and test results in the all datacenters. After remedial action this issue was resolved.

  2. postmortem Jun 20, 2025, 01:48 PM UTC

    ### **Dates:** Friday June 13th 2025, 07:47 UTC - 09:07 UTC ### **What happened:** A release impacted components of the Sauce Labs Web UI, making them unusable. ### **Why it happened:** Certain fonts were blocked from loading due to CORS issue. ### **How we fixed it:** The release was rolled back. ### **What we are doing to prevent it from happening again:** We are adding additional validation steps to prevent this from recurring.

Read the full incident report →

Minor May 14, 2025

2025-May-14 Service Incident 1

Detected by Pingoru
May 14, 2025, 04:49 PM UTC
Resolved
May 14, 2025, 05:16 PM UTC
Duration
27m
Affected: US-WestUS-West
Timeline · 4 updates
  1. investigating May 14, 2025, 04:49 PM UTC

    We are currently investigating an issue where video recordings for Android tests are intermittently missing in our US West Datacenter. The issue began on May 13th at approximately 13:00 CEST.

  2. investigating May 14, 2025, 05:16 PM UTC

    We are continuing to investigate this issue.

  3. resolved May 14, 2025, 05:16 PM UTC

    We have identified the root cause and deployed a fix for this issue. This incident is resolved.

  4. postmortem Jun 13, 2025, 10:44 AM UTC

    ### **Dates:** Thursday May 13th 2025, 10:26 UTC - Friday May 14th 2025 16:48 UTC ### **What happened:** Video recordings for Android tests intermittently failed in the US-West-1 datacenter. ### **Why it happened:** A service involved in capturing recordings entered a deadlocked state when receiving a specific combination of requests and options. ### **How we fixed it:** We cleared the deadlocked service. ### **What we are doing to prevent it from happening again:** We are introducing additional monitors and automatic recovery for the deadlocked condition.

Read the full incident report →

Critical May 14, 2025

2025-May-14 Service Incident

Detected by Pingoru
May 14, 2025, 09:10 AM UTC
Resolved
May 14, 2025, 09:40 AM UTC
Duration
30m
Affected: US-EastUS-EastUS-East
Timeline · 3 updates
  1. investigating May 14, 2025, 09:10 AM UTC

    We are currently seeing errors in uploading and installing apps in our US-East-4 Datacenter. We are investigating.

  2. resolved May 14, 2025, 09:40 AM UTC

    After taking remedial action, app uploads and testing are now both working correctly. This incident is resolved

  3. postmortem Jun 13, 2025, 10:40 AM UTC

    ### **Dates:** Wednesday May 14th 2025, 08:00 UTC - 09:03 UTC ### **What happened:** An authentication failure lead to Real Device live & automated app tests our in US-East-4 Datacenter. ### **Why it happened:** Connections from the App Storage service failed due to a database issue. ### **How we fixed it:** The database cluster required restarting. ### **What we are doing to prevent it from happening again:** Database clean-ups will be done only during maintenance windows.

Read the full incident report →

Notice May 6, 2025

2025-May-6 Resolved Service Incident

Detected by Pingoru
May 06, 2025, 09:16 AM UTC
Resolved
May 06, 2025, 02:30 AM UTC
Duration
Timeline · 2 updates
  1. resolved May 06, 2025, 09:16 AM UTC

    Between 03:38 - 03:58 UTC, Real Devices in our EU data center were unavailable. After taking remedial action, the issue has been resolved. All services are fully operational.

  2. postmortem May 17, 2025, 10:51 AM UTC

    ### **Dates:** Tuesday April 6th 2025, 03:38 UTC - 03:58 UTC ### **What happened:** RDC devices were unavailable in the EU-Central-1 Datacenter. ### **Why it happened:** There was a DNS caching failure during a third party provider’s maintenance. ### **How we fixed it:** Availability was restored automatically, after the instances completed their start-up. ### **What we are doing to prevent it from happening again:** We will update our DNS caching for better fault tolerance.

Read the full incident report →

Notice April 23, 2025

2025-April-23 Resolved Service Incident

Detected by Pingoru
Apr 23, 2025, 05:42 PM UTC
Resolved
Apr 23, 2025, 05:42 PM UTC
Duration
Timeline · 2 updates
  1. resolved Apr 23, 2025, 05:42 PM UTC

    Between 16:55 - 17:25 UTC, we experienced Android Real Device Test failures in our EU data center. After taking remedial action, the issue has been resolved. All services are fully operational.

  2. postmortem Jun 13, 2025, 10:36 AM UTC

    ### **Dates:** Wednesday April 23rd 2025, 16:50 UTC - 17:50 UTC ### **What happened:** Native Android tests failed in the EU-Central-1 datacenter. ### **Why it happened:** A large increase in simultaneous jobs with unique applications and cache misses, exhausted resources on the hosts responsible for installing Android native applications. ### **How we fixed it:** Resources were increased to cope with demand. ### **What we are doing to prevent it from happening again:** We have introduced additional controls on native application installs to prevent simultaneous downloads of the same app to a single host .

Read the full incident report →

Major April 2, 2025

2025-April-2 Service Incident

Detected by Pingoru
Apr 02, 2025, 09:53 AM UTC
Resolved
Apr 02, 2025, 03:23 PM UTC
Duration
5h 29m
Affected: US-WestUS-WestEU-CentralEU-Central
Timeline · 5 updates
  1. investigating Apr 02, 2025, 09:53 AM UTC

    We are seeing failing iOS simulator tests when using Sauce Connect in US-West-1 & EU-Central-1 Data Centers. We are investigating.

  2. investigating Apr 02, 2025, 11:03 AM UTC

    We are continuing to see iOS simulator test failures when using Sauce Connect in the US-West-1 & EU-Central-1 Data Centers. We are actively investigating.

  3. monitoring Apr 02, 2025, 01:48 PM UTC

    We have identified the root cause and have deployed a fix for this issue. We are monitoring.

  4. resolved Apr 02, 2025, 03:23 PM UTC

    After taking remedial action, all services are operating as normal. This incident is resolved.

  5. postmortem May 17, 2025, 10:45 AM UTC

    ### **Dates:** Tuesday April 1st 2025, 14:54 UTC - Wednesday April 2nd 2025, 13:45 UTC ### **What happened:** iOS tests using Sauce Connect failed to start. ### **Why it happened:** There was a mismatch in the SSL certificate's expiration dates. ### **How we fixed it:** Re-ordered new SSL certificates. ### **What we are doing to prevent it from happening again:** We are adding monitoring for the SSL certificates.

Read the full incident report →

Major March 27, 2025

2025-March-27 Service Incident

Detected by Pingoru
Mar 27, 2025, 02:23 PM UTC
Resolved
Mar 27, 2025, 02:35 PM UTC
Duration
12m
Affected: US-WestUS-WestUS-WestUS-WestUS-WestUS-WestMobile App Distribution Platform
Timeline · 3 updates
  1. investigating Mar 27, 2025, 02:23 PM UTC

    We are seeing failures due to issue with our storage service affecting Virtual and Real Device Cloud as well as any saucectl and Mobile App Distribution in our US-West Data Center. We are investigating

  2. resolved Mar 27, 2025, 02:35 PM UTC

    App storage errors have been resolved. All services are fully operational.

  3. postmortem May 17, 2025, 10:41 AM UTC

    ### **Dates:** Wednesday March 26th 2025, 19:15 - Thursday March 27th 2025, 14:30 UTC ### **What happened:** An internal cache for mobile applications in one datacenter was unable to reach its origin, causing artifacts to age out and no longer be available to be served. ### **Why it happened:** A change to the cache’s routing prevented it from reaching its origin. ### **How we fixed it:** The routing change was reverted. ### **What we are doing to prevent it from happening again:** Additional monitoring and alerting has been implemented to identify this issue more quickly in the future.

Read the full incident report →

Major March 6, 2025

2025-March-6 Service Incident

Detected by Pingoru
Mar 06, 2025, 12:18 PM UTC
Resolved
Mar 06, 2025, 12:20 PM UTC
Duration
2m
Affected: US-WestUS-WestEU-CentralEU-CentralUS-EastUS-East
Timeline · 3 updates
  1. investigating Mar 06, 2025, 12:18 PM UTC

    We are currently experiencing significantly reduced availability of Android Real Devices in all our datacenters. We are investigating.

  2. resolved Mar 06, 2025, 12:20 PM UTC

    After taking remedial action, Android Real Devices are now available in all our datacenters. This incident is resolved

  3. postmortem Apr 01, 2025, 04:34 PM UTC

    ### **Dates:** Thursday March 6th 2025, 12:00 - 12:22 UTC ### **What happened:** Customers were unable to start new Android Real device tests in all datacentres. ### **Why it happened:** During a deployment, a race condition occurred during the reallocation of devices from "old" device pools to "new" device pools which caused devices to become unavailable in both pools. ### **How we fixed it:** The "old" pools were shutdown releasing the device lock, allowing the "new" pools to acquire the lock. ### **What we are doing to prevent it from happening again:** We are improving our device allocation processes.

Read the full incident report →

Major February 22, 2025

2025-February-21 Service Incident

Detected by Pingoru
Feb 22, 2025, 01:33 AM UTC
Resolved
Feb 22, 2025, 05:20 AM UTC
Duration
3h 46m
Affected: US-WestEU-Central
Timeline · 5 updates
  1. investigating Feb 22, 2025, 01:33 AM UTC

    New Sauce Connect Tunnels are not able to be created on our US-West-1 & EU-Central-1 Datacenters. We are investigating.

  2. investigating Feb 22, 2025, 01:34 AM UTC

    We are continuing to investigate this issue.

  3. identified Feb 22, 2025, 02:25 AM UTC

    We have identified the issue and have taken a remedial action. We are monitoring.

  4. resolved Feb 22, 2025, 05:20 AM UTC

    All services are now operating as normal. This incident is resolved.

  5. postmortem May 17, 2025, 10:54 AM UTC

    ### **Dates:** Saturday February 22nd 2025, 00:00 - 04:46 UTC ### **What happened:** Customers were unable to create SauceConnect tunnels in our US-West-1 and EU-Central-1 regions. ### **Why it happened:** The SSL certificate for the Sauce Connect frontend had expired. ### **How we fixed it:** The certificate was renewed and deployed to the affected regions. ### **What we are doing to prevent it from happening again:** We are improving our alerting around SSL certificate expiration.

Read the full incident report →

Notice February 17, 2025

2025-Feb-17 Resolved Service Incident

Detected by Pingoru
Feb 17, 2025, 06:44 PM UTC
Resolved
Feb 17, 2025, 06:44 PM UTC
Duration
Timeline · 2 updates
  1. resolved Feb 17, 2025, 06:44 PM UTC

    Between 12:52 and 13:53 UTC, some Real Device tests in EU-Central-1 Data center experienced failures reaching internet destinations because of an issue on an upstream ISP network. We have identified the issue and taken remedial action.

  2. postmortem Apr 01, 2025, 04:25 PM UTC

    ### **Dates:** Wednesday February 17th 2023, 12:52 - 13:53 UTC **What happened:** Real Device tests in EU-Central-1 datacenter were experiencing issues reaching internet destinations. ### **Why it happened:** A new internet provider was introduced that was experiencing issues with their network.. ### **How we fixed it:** A rollback was executed to move traffic off of the affected internet provider. ### **What we are doing to prevent it from happening again:** We engaged the provider and they corrected configuration issues with their network.

Read the full incident report →

Major January 21, 2025

2025-January-21 Service Incident

Detected by Pingoru
Jan 21, 2025, 01:39 PM UTC
Resolved
Jan 21, 2025, 01:46 PM UTC
Duration
7m
Affected: US-WestUS-WestUS-WestUS-WestUS-WestEU-CentralEU-CentralEU-CentralEU-CentralEU-Central
Timeline · 3 updates
  1. investigating Jan 21, 2025, 01:39 PM UTC

    We are currently seeing App installation errors when trying to run iOS tests on our US-West-1 & EU-Central-1 Datacenter. We are investigating

  2. resolved Jan 21, 2025, 01:46 PM UTC

    After taking remedial action, apps are now installing correctly in all datacenters. This incident is resolved.

  3. postmortem Apr 01, 2025, 04:19 PM UTC

    ### Dates: Tuesday January 21st 2025, 12:40 - 13:44 UTC ### **What happened:** Tests using iOS Real Devices experienced failures to download and install apps for the eu-central01 and us-west-1 datacenters. **Why it happened:** A misconfiguration with the resigner service caused errors when communicating with the app storage. ### **How we fixed it:** A rollback was executed, restoring configuration to the previous working state. ### **What we are doing to prevent it from happening again:** * Improve canary deployments for each Real Device Cloud region. * Improve end to end testing for canary deployments. * Improve SLOs for the Real Device application installs

Read the full incident report →

Major January 8, 2025

2025-January-8 Service Incident

Detected by Pingoru
Jan 08, 2025, 11:40 AM UTC
Resolved
Jan 08, 2025, 12:54 PM UTC
Duration
1h 14m
Affected: US-WestUS-WestUS-WestUS-WestUS-WestUS-WestUS-WestUS-West
Timeline · 6 updates
  1. investigating Jan 08, 2025, 11:40 AM UTC

    We are currently seeing Live and automated test results are not being retained intermittently in our US-West-1 data centers. We are investigating.

  2. investigating Jan 08, 2025, 11:56 AM UTC

    We are currently seeing Live and Automated test results are not being retained for Virtual Device tests intermittently in our US-West-1 data centers. We are investigating.

  3. investigating Jan 08, 2025, 12:41 PM UTC

    We are continuing to investigate this issue.

  4. investigating Jan 08, 2025, 12:44 PM UTC

    We are currently seeing intermittent issues with test assets not being retained for Virtual device tests in our US-West-1 data center. We are investigating.

  5. resolved Jan 08, 2025, 12:54 PM UTC

    After taking remedial action, test assets are being retained successfully in all tests. This incident is resolved.

  6. postmortem Apr 01, 2025, 04:14 PM UTC

    ### **Dates:** Wednesday 8 January 2025, 11:00 - 12:25 UTC ### **What happened:** We were experiencing intermittent issues with Virtual device tests missing test assets in the US-West-1 region ### **Why it happened:** The asset uploader service was experiencing Http errors uploading assets to our backend storage. ### **How we fixed it:** The 3rd-party provider resolved an issue with their infrastructure. ### **What we are doing to prevent it from happening again:** We are improving the caching and retry logic of the asset uploader service to prevent further occurrence.

Read the full incident report →

Critical January 7, 2025

2025-January-7 Service Incident

Detected by Pingoru
Jan 07, 2025, 09:12 AM UTC
Resolved
Jan 07, 2025, 06:09 PM UTC
Duration
8h 56m
Affected: Mobile App Distribution PlatformMobile App Distribution UI
Timeline · 4 updates
  1. investigating Jan 07, 2025, 09:12 AM UTC

    We are currently seeing .testfairy.com Website pages throwing full-page red alert error messages. We are investigating.

  2. investigating Jan 07, 2025, 09:36 AM UTC

    There is currently an issue with accessing *.testfairy.com pages via Chromium-based browsers, preventing app distribution. Safari browsers can still access these sites. We are investigating.

  3. investigating Jan 07, 2025, 10:58 AM UTC

    There is currently an intermittent issue with accessing *.testfairy.com pages via Chromium-based browsers, preventing app distribution. Safari browsers can still access these sites. We are investigating.

  4. resolved Jan 07, 2025, 06:09 PM UTC

    Access to *.testfairy.com via Chromium-based browsers has been restored. This incident is resolved.

Read the full incident report →

Major January 3, 2025

2025-January-3 Service Incident

Detected by Pingoru
Jan 03, 2025, 11:06 AM UTC
Resolved
Jan 03, 2025, 11:10 AM UTC
Duration
3m
Affected: US-WestUS-West
Timeline · 3 updates
  1. investigating Jan 03, 2025, 11:06 AM UTC

    We are currently seeing an issue with Live and automated test results not being displayed on the test results page in our US-West-1 data center. We are investigating.

  2. resolved Jan 03, 2025, 11:10 AM UTC

    After taking remedial action, Test Results are now available again in all data centers. This incident is resolved.

  3. postmortem Apr 01, 2025, 04:05 PM UTC

    ### **Dates:** Friday 3 January 2025, 09:52 - 11:07 UTC ### **What happened:** Customers were unable to access test results in the Web UI for our US-West-1 datacenter. ### **Why it happened:** A defect was introduced during a product deployment. ### **How we fixed it:** A rollback was executed to the previous working version. ### **What we are doing to prevent it from happening again:** We are creating additional checks for the authentication method upgrades.

Read the full incident report →

Major December 18, 2024

2024-December-18 Service Incident

Detected by Pingoru
Dec 18, 2024, 10:43 AM UTC
Resolved
Dec 18, 2024, 11:57 AM UTC
Duration
1h 13m
Affected: US-WestEU-Central
Timeline · 3 updates
  1. investigating Dec 18, 2024, 10:43 AM UTC

    There is currently an issue with purchasing self-service plans from our dashboard, We are investigating.

  2. resolved Dec 18, 2024, 11:57 AM UTC

    Customers are able to purchase Self-Service plans from our dashboard again. This issue is resolved.

  3. postmortem Apr 01, 2025, 03:51 PM UTC

    ### **Dates:** Wednesday December 18th 2024, 09:05 - 11:31 UTC ### **What happened:** Customers were unable to purchase self-service plans through the billing page. ### **Why it happened:** Breaking changes were introduced between our billing service client SDK and the third-party billing provider. ### **How we fixed it:** The SDK for the third-party billing provider was updated in our service. ### **What we are doing to prevent it from happening again:** We’re working with the third-party billing provider to better understand their versioning practices to ensure we remain inline with them.

Read the full incident report →

Notice December 12, 2024

2024-December-12 Resolved Service Incident

Detected by Pingoru
Dec 12, 2024, 01:59 PM UTC
Resolved
Dec 12, 2024, 01:59 PM UTC
Duration
Affected: US-WestUS-WestUS-WestEU-CentralEU-CentralEU-CentralUS-EastUS-EastUS-East
Timeline · 2 updates
  1. resolved Dec 12, 2024, 01:59 PM UTC

    Between 12:40 UTC and 13:20 UTC, Video recordings were missing in test reports for Android tests, affected all regions . We have identified the issue and taken remedial action.

  2. postmortem Apr 01, 2025, 03:46 PM UTC

    ### **Dates:** Thursday December 12 2024, 12:40 - 13:30 UTC ### **What happened:** Video recordings for Android Real Device tests were not showing in test results. ### **Why it happened:** A code deployment contained a broken dependency. ### **How we fixed it:** A rollback was executed to the previous known working version. ### **What we are doing to prevent it from happening again:** A check for broken dependencies was added to the deployment.

Read the full incident report →

Major November 19, 2024

2024-November-19 Service Incident

Detected by Pingoru
Nov 19, 2024, 09:35 PM UTC
Resolved
Nov 19, 2024, 10:28 PM UTC
Duration
53m
Affected: US-WestUS-WestUS-WestEU-CentralEU-CentralEU-CentralUS-EastUS-EastUS-East
Timeline · 3 updates
  1. investigating Nov 19, 2024, 09:35 PM UTC

    Android real devices in US West 1, EU Central 1, and US East datacenter are unable to run automated Appium and Espresso app tests and manual app tests. We are investigating.

  2. resolved Nov 19, 2024, 10:28 PM UTC

    After taking remedial action all services are operating as normal. This incident is resolved.

  3. postmortem Apr 01, 2025, 03:54 PM UTC

    ### **Dates:** Wednesday November 19th 2024, 21:30 - 23:30 UTC ### **What happened:** Android real devices were unable to run app tests in all datacenters. ### **Why it happened:** An unexpected policy change by Google caused MDM managed devices to be locked down due to policy violations around accessibility services. ### **How we fixed it:** We rolled back enablement of the TalkBack accessibility service. ### **What we are doing to prevent it from happening again:** We opened a support case with Google to get further information on why this policy change happened, as well as began investigating running our own MDM solution.

Read the full incident report →

Major October 30, 2024

2024-October-30 Service Incident

Detected by Pingoru
Oct 30, 2024, 07:00 PM UTC
Resolved
Oct 30, 2024, 09:52 PM UTC
Duration
2h 52m
Affected: US-WestUS-WestUS-WestUS-WestUS-WestUS-WestUS-WestUS-WestVisual Testing HubUS-WestVisual Testing InfrastructureUS-West
Timeline · 4 updates
  1. investigating Oct 30, 2024, 07:00 PM UTC

    We are currently seeing intermittent issues with Sauce Connect tunnel startup and allocation in the US-West-1 data center. We are investigating.

  2. monitoring Oct 30, 2024, 08:53 PM UTC

    We have taken remedial action and are seeing an improvement with Sauce Connect tunnel startup and allocation in the US-West-1 data center. We are monitoring

  3. resolved Oct 30, 2024, 09:52 PM UTC

    After taking remedial action all services are operating as normal. This incident is resolved.

  4. postmortem Apr 01, 2025, 03:39 PM UTC

    ### **Dates:** Wednesday October 30th 2023, 03:45 - 21:06 UTC ### **What happened:** Intermittent errors occurred when starting or using Sauce Connect tunnel in the US West datacenter. ### **Why it happened:** A service responsible for creating new bindings between the tunnel endpoints and test VMs experienced timeouts on some hosts. ### **How we fixed it:** Service was restored after clearing bindings on the affected hosts. ### **What we are doing to prevent it from happening again:** Monitoring and alerting for when the service is unable to create new bindings has been improved. The root cause of the condition is under investigation.

Read the full incident report →