Crusoe Outage History

Crusoe is up right now

Crusoe had 36 outages in the last 2 years totaling 102h 38m of downtime — averaging 1.5 incidents per month.

There were 36 Crusoe outages since June 11, 2025 totaling 102h 38m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.crusoecloud.com

Major September 30, 2025

Partial Outage in eu-iceland1-a

Detected by Pingoru
Sep 30, 2025, 06:45 PM UTC
Resolved
Oct 01, 2025, 12:00 AM UTC
Duration
5h 14m
Affected: eu-iceland1
Timeline · 5 updates
  1. investigating Sep 30, 2025, 06:45 PM UTC

    We are currently investigating an issue impacting virtual machine accessibility in eu-iceland1-a region. Some users may be unable to access their VMs at this time. We appreciate your patience and will provide updates as more information becomes available.

  2. investigating Sep 30, 2025, 07:48 PM UTC

    We're making significant progress in resolving the virtual machine accessibility issues in the eu-iceland1-a region. We apologize for the continued disruption and thank you for your patience as we work to restore full service. We will provide another update as soon as more information is available or when services are fully restored. You can continue to contact our support team at [email protected], if there are any other concerns.

  3. identified Sep 30, 2025, 08:34 PM UTC

    We have identified and mitigated the issue. Although no further impact is expected, we are continuing to monitor. If you are still experiencing issues, please reach out to us through [email protected] and we will investigate further.

  4. monitoring Sep 30, 2025, 09:55 PM UTC

    A fix has been applied and the outage has been mitigated. Services have returned to normal operations in eu-iceland1-a region. We are continuing to monitor. Please contact [email protected] if you are still experiencing any issues.

  5. resolved Oct 01, 2025, 12:00 AM UTC

    This incident has been resolved. Please contact [email protected] if you are still experiencing any issues.

Read the full incident report →

Major September 30, 2025

Partial outage in us-southcentral

Detected by Pingoru
Sep 30, 2025, 10:22 AM UTC
Resolved
Sep 30, 2025, 10:49 AM UTC
Duration
27m
Affected: us-southcentral1
Timeline · 2 updates
  1. investigating Sep 30, 2025, 10:22 AM UTC

    We are currently investigating an issue with machine accessibility and performance issues in the us-southcentral1 region . Some users may be unable to access their VMs at this time or see performance degradation. We appreciate your patience and will provide updates as more information becomes available.

  2. resolved Sep 30, 2025, 10:49 AM UTC

    A fix has been implemented. This incident is now resolved

Read the full incident report →

Minor September 19, 2025

CMK Nodepool Quota Not Visible in UI

Detected by Pingoru
Sep 19, 2025, 09:53 PM UTC
Resolved
Sep 20, 2025, 01:14 AM UTC
Duration
3h 20m
Affected: UICrusoe Managed Kubernetes (CMK)
Timeline · 2 updates
  1. investigating Sep 19, 2025, 09:53 PM UTC

    Please be aware that the user interface is currently not showing the correct capacity for CMK nodepool creation, even when quota is available. This is a UI-specific bug. As a temporary workaround, please use the command-line interface (CLI) to successfully create nodepools. Our team is investigating the root cause and will deploy a fix as soon as possible.

  2. resolved Sep 20, 2025, 01:14 AM UTC

    The fix has been deployed, and the user interface now correctly displays the capacity for CMK nodepool creation. You can now use either the UI or the CLI to provision new nodepools.

Read the full incident report →

Major September 19, 2025

VMs in us-southcentral1-a region failing to start with Internal Server Error

Detected by Pingoru
Sep 19, 2025, 07:33 AM UTC
Resolved
Sep 19, 2025, 01:20 PM UTC
Duration
5h 46m
Affected: us-southcentral1
Timeline · 2 updates
  1. investigating Sep 19, 2025, 07:33 AM UTC

    We are currently investigating an issue impacting the ability to start Virtual Machines (VMs) in our us-southcentral1-a region. Our engineering team has identified a potential issue and is actively working on a resolution. Thank you for your patience as we work to restore full functionality.

  2. resolved Sep 19, 2025, 01:20 PM UTC

    A fix has been implemented. This incident is now resolved

Read the full incident report →

Major August 20, 2025

Service Degradation in us-east1-a Region due to Power Disruption

Detected by Pingoru
Aug 20, 2025, 05:09 PM UTC
Resolved
Aug 21, 2025, 11:35 PM UTC
Duration
1d 6h
Affected: us-east1Infiniband Networks
Timeline · 4 updates
  1. investigating Aug 20, 2025, 05:09 PM UTC

    We're investigating a service degradation in our us-east1-a region, triggered by a facility power disruption at our data center. The primary impact is to the Infiniband networking fabric, which may cause intermittent errors or failures for multi-node, distributed workloads. Some customers may also experience individual virtual machines becoming unavailable. Our teams are working to identify all affected resources. Our engineering teams are actively working to stabilize the affected systems and mitigate the risk of further disruption. We are coordinating with our data center provider to support their remediation efforts and restore full service resiliency as quickly as possible. We apologize for any impact this is causing.

  2. investigating Aug 21, 2025, 02:32 AM UTC

    We are continuing to investigate and mitigate the service degradation affecting our us-east1-a region, following a facility power disruption at our data center. Our teams remain in close coordination with the data center provider as they work to fully restore services. Recovery of critical systems remains our top priority. We sincerely apologize for the ongoing impact and appreciate your continued patience as we work to resolve the issue.

  3. monitoring Aug 21, 2025, 07:59 PM UTC

    We have successfully mitigated the issue affecting the us-east1-a region. The facility power disruption has been addressed, and the impacted Infiniband networking fabric and associated systems have returned to normal operation. All affected services are now stable, and full functionality has been restored. Our teams will continue to monitor the region closely to ensure continued stability. We appreciate your patience during this incident and apologize again for any disruption it may have caused.

  4. resolved Aug 21, 2025, 11:35 PM UTC

    This incident has been resolved.

Read the full incident report →

Major August 1, 2025

VM creation and networking failure for A100 Infiniband type VMs in us-east region

Detected by Pingoru
Aug 01, 2025, 03:18 AM UTC
Resolved
Aug 01, 2025, 06:21 AM UTC
Duration
3h 2m
Affected: us-east1
Timeline · 4 updates
  1. investigating Aug 01, 2025, 03:18 AM UTC

    We have identified an issue that is preventing new or restarted Virtual Machines from booting successfully on our A100 Infiniband hardware fleet. Any new VM provisioning request for this hardware type will also fail. Additionally, any existing VM on an A100 Infiniband node that is stopped and started (or rebooted) will also fail to come back online. Existing, currently running VMs are not affected and will continue to operate normally. We advise customers to avoid rebooting critical workloads on this hardware until a resolution is in place. Our engineering teams are actively investigating the root cause and are working to restore normal provisioning operations as quickly as possible.

  2. identified Aug 01, 2025, 03:44 AM UTC

    The issue has been identified, and we have tested a fix internally. We are working on rolling out the fix to our A100 Infiniband type servers now.

  3. monitoring Aug 01, 2025, 05:06 AM UTC

    A fix has been implemented, and we are monitoring the environment for now.

  4. resolved Aug 01, 2025, 06:21 AM UTC

    This incident is now resolved

Read the full incident report →

Major July 24, 2025

Issues affecting persistent storage in eu-iceland1

Detected by Pingoru
Jul 24, 2025, 08:30 AM UTC
Resolved
Jul 24, 2025, 11:36 AM UTC
Duration
3h 6m
Affected: Persistent Storage
Timeline · 4 updates
  1. investigating Jul 24, 2025, 08:30 AM UTC

    We are currently investigating an issue affecting VM's in eu-iceland1

  2. identified Jul 24, 2025, 10:26 AM UTC

    We have identified an issue with our persistent disk storage system for a subset of VM's in eu-iceland1. Our engineers are now working to restore service.

  3. monitoring Jul 24, 2025, 10:52 AM UTC

    A fix has been implemented for the persistent disk storage system. We are observing recovery, and affected virtual machines in eu-iceland1 are returning to a healthy state. Our team will continue to monitor the platform to ensure full service restoration and stability. We will provide the next update once the incident is fully resolved. We appreciate your understanding and support. If you experience any issues, please don't hesitate to contact us at [email protected].

  4. resolved Jul 24, 2025, 11:36 AM UTC

    This incident has been resolved.

Read the full incident report →

Major July 11, 2025

Investigating Control Plane Failures in US-Southcentral

Detected by Pingoru
Jul 11, 2025, 02:28 PM UTC
Resolved
Jul 11, 2025, 04:21 PM UTC
Duration
1h 52m
Affected: API
Timeline · 4 updates
  1. investigating Jul 11, 2025, 02:28 PM UTC

    We're currently investigating an issue impacting our control plane in the US-Southcentral region. Customers may experience errors or timeouts when making certain API calls, including those for creating new virtual machines. Our engineering team is investigating the underlying cause with the highest priority. We appreciate your patience and will provide another update as soon as we have more information.

  2. identified Jul 11, 2025, 03:11 PM UTC

    We've identified a potential issue on the control plane network and we're working to address the issue currently.

  3. monitoring Jul 11, 2025, 03:32 PM UTC

    A fix has been applied and the outage has been mitigated. Services have returned to normal operations in US-Southcentral region. We are continuing to monitor the situation and validate preventive measures. Please contact [email protected] if you are still experiencing any issues.

  4. resolved Jul 11, 2025, 04:21 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor June 30, 2025

Certain VM's in us-northcentral1-a are unavailable

Detected by Pingoru
Jun 30, 2025, 06:05 PM UTC
Resolved
Jul 02, 2025, 05:21 PM UTC
Duration
1d 23h
Affected: Persistent Storage
Timeline · 8 updates
  1. investigating Jun 30, 2025, 06:05 PM UTC

    We are currently investigating a potential service interruption that may impact VMs to be unreachable in the us-northcentral1-a region. If you are experiencing issues in this region, please reach out to [email protected]

  2. investigating Jun 30, 2025, 10:01 PM UTC

    Our teams are working diligently and believe we have identified the root cause and we are working on implementing a resolution. We will continue to provide updates on this page as further developments occur.

  3. investigating Jul 01, 2025, 01:34 AM UTC

    We are continuing our in-depth investigation into the intermittent storage connectivity issues impacting VM reachability in the us-northcentral1-a region. We will provide more information as it becomes available.

  4. investigating Jul 01, 2025, 05:21 AM UTC

    Our team is continuing its in-depth investigation into the intermittent storage connectivity issues impacting VM reachability in the us-northcentral1-a region, in collaboration with our vendor. We will keep you posted on further developments

  5. investigating Jul 01, 2025, 10:06 AM UTC

    We are continuing our dedicated investigation into the intermittent storage connectivity issues impacting VM reachability in us-northcentral1-a. Our teams are working in close collaboration with our storage vendor to analyze the underlying cause. We recognize the impact this is having and have all necessary resources allocated to resolving this.

  6. investigating Jul 01, 2025, 02:14 PM UTC

    We are seeing signs of recovery, and some previously impacted VMs in the us-northcentral1-a region are now available. Our teams are closely monitoring the environment and continuing their work with our vendor to restore full service for all remaining affected instances.

  7. monitoring Jul 01, 2025, 08:57 PM UTC

    A fix has been applied and the outage has been mitigated. Services have returned to normal operations in us-northcentral1-a region. We are continuing to monitor the situation and validate preventive measures. Please contact [email protected] if you are still experiencing any issues.

  8. resolved Jul 02, 2025, 05:21 PM UTC

    This incident has been resolved.

Read the full incident report →

Major June 12, 2025

Control Plane Service Outage

Detected by Pingoru
Jun 12, 2025, 07:55 PM UTC
Resolved
Jun 12, 2025, 09:35 PM UTC
Duration
1h 40m
Affected: APIUIus-east1us-northcentral1us-southcentral1eu-iceland1
Timeline · 6 updates
  1. investigating Jun 12, 2025, 07:55 PM UTC

    We are currently experiencing an outage with our 3rd party service provider, which is intermittently impacting the ability to provision and manage Virtual Machines (VMs) across all of our regions.

  2. investigating Jun 12, 2025, 08:03 PM UTC

    We are continuing to investigate this issue.

  3. investigating Jun 12, 2025, 08:06 PM UTC

    We are continuing to investigate this issue.

  4. investigating Jun 12, 2025, 08:16 PM UTC

    We are expanding the scope of this incident to include the Crusoe console being inaccessible. This means that users are currently unable to log in, view, or manage resources through the Crusoe console interface. We are investigating this alongside the existing issues.

  5. monitoring Jun 12, 2025, 09:08 PM UTC

    We're starting to see signs of recovery across affected services, and our team is monitoring the situation very closely.

  6. resolved Jun 12, 2025, 09:35 PM UTC

    All Crusoe Cloud services have been restored and are fully operational. We will continue to monitor our platform. If you notice any issues please reach out to [email protected]

Read the full incident report →

Notice June 11, 2025

Partial outage in us-east1

Detected by Pingoru
Jun 11, 2025, 03:10 AM UTC
Resolved
Jun 11, 2025, 03:36 AM UTC
Duration
25m
Affected: us-east1
Timeline · 3 updates
  1. investigating Jun 11, 2025, 03:10 AM UTC

    We are currently investigating an issue impacting virtual machine accessibility. Some users may be unable to stop and start Virtual Machines.

  2. monitoring Jun 11, 2025, 03:20 AM UTC

    We have identified the issue and a fix has been implemented. Currently, there is no further impact expected to virtual machine accessibility. We are currently monitoring the environment.

  3. resolved Jun 11, 2025, 03:36 AM UTC

    We are marking this incident as resolved. Please reach out to [email protected] if you experience any issues.

Read the full incident report →