Aptible Outage History

Aptible is up right now

Aptible had 28 outages in the last 2 years totaling 158h 18m of downtime — averaging 1.2 incidents per month.

There were 28 Aptible outages since June 14, 2024 totaling 158h 18m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.aptible.com

Notice May 8, 2026

Aptible response to CVE-2026-43284

Detected by Pingoru
May 08, 2026, 05:02 AM UTC
Resolved
May 08, 2026, 03:35 PM UTC
Duration
10h 32m
Timeline · 2 updates
  1. monitoring May 08, 2026, 05:02 AM UTC

    Aptible is aware of “Dirty Frag,” a recently disclosed Linux kernel vulnerability that may allow local privilege escalation under certain conditions [0]. We have taken action across the Aptible platform to protect all customer workloads against this vulnerability. No customer action is required at this time. We will continue to monitor upstream kernel, distribution, and security vendor guidance related to this vulnerability and will take any additional action if needed. [0] https://www.openwall.com/lists/oss-security/2026/05/07/8

  2. resolved May 08, 2026, 03:35 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice May 8, 2026

Instability in us-east-1

Detected by Pingoru
May 08, 2026, 01:05 AM UTC
Resolved
May 09, 2026, 03:35 AM UTC
Duration
1d 2h
Affected: Aptible Deploy
Timeline · 4 updates
  1. investigating May 08, 2026, 01:05 AM UTC

    Due to an underlying issue in AWS we are experiencing instability in a certain availability zone in us-east-1. We are investigating this issue and will follow up with recommendations shortly

  2. identified May 08, 2026, 01:39 AM UTC

    We have blocked operations from provisioning into the specified region

  3. monitoring May 08, 2026, 02:02 PM UTC

    We have unblocked operations from the specified region as we have seen successful provisioning and maintenance of new instance in that availability zone. Instances that were affected have not yet seen recovery, and the upstream AWS alert indicates they may still take hours of recovery.

  4. resolved May 09, 2026, 03:35 AM UTC

    An upstream AWS issue affected infrastructure in a single Availability Zone in us-east-1. During the event, we blocked provisioning into the affected zone and monitored AWS’s recovery efforts. AWS has since recovered enough of the affected availability zone for the impacted Aptible infrastructure to recover. We have also completed recovery of any Aptible-managed resources that required follow-up action on our side. Any customers with directly affected resources have been notified of resolution. We are continuing normal platform monitoring, but we are marking this incident as resolved on our end.

Read the full incident report →

Notice April 29, 2026

Aptible response to CVE-2026-31431

Detected by Pingoru
Apr 29, 2026, 10:40 PM UTC
Resolved
Apr 29, 2026, 10:40 PM UTC
Duration
Timeline · 1 update
  1. resolved Apr 29, 2026, 10:40 PM UTC

    Aptible is aware of CVE-2026-31431, also known as “Copy Fail,” a recently disclosed Linux kernel vulnerability that may allow local privilege escalation under certain conditions [0]. We have taken action across the Aptible platform to protect all customer workloads against this vulnerability. No customer action is required at this time. We will continue to monitor upstream kernel, distribution, and security vendor guidance related to this CVE and will take any additional action if needed. [0] https://copy.fail/

Read the full incident report →

Notice April 19, 2026

Service degradation affecting api.aptible.com

Detected by Pingoru
Apr 19, 2026, 02:03 AM UTC
Resolved
Apr 19, 2026, 03:10 AM UTC
Duration
1h 6m
Affected: api.aptible.com
Timeline · 3 updates
  1. investigating Apr 19, 2026, 02:03 AM UTC

    We are investigating an incident impacting api.aptible.com. Viewing current resources and running operations are currently impacted, however running applications and databases are unaffected

  2. monitoring Apr 19, 2026, 02:54 AM UTC

    A fix has been implemented and operations that were failing to enqueue during the availability outage have been retried. We will continue monitoring the situation.

  3. resolved Apr 19, 2026, 03:10 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor March 17, 2026

us-east-1 Intermittent DNS Resolution Errors

Detected by Pingoru
Mar 17, 2026, 10:33 PM UTC
Resolved
Mar 17, 2026, 11:33 PM UTC
Duration
59m
Affected: AWS EC2 (us-east-1 — Virginia)
Timeline · 4 updates
  1. investigating Mar 17, 2026, 10:33 PM UTC

    We are currently observing intermittent DNS resolution failures affecting infrastructure in us-east-1. These resolver errors may cause some Aptible Deploy operations (such as deploys, restarts, SSH, or container provisioning) to fail or time out. Applications running on Aptible Deploy that rely on network hostname resolution may also experience intermittent connection failures related to DNS resolution. We are actively working with our infrastructure providers to investigate the issue and will provide updates here as they are available.

  2. monitoring Mar 17, 2026, 11:02 PM UTC

    We have observed DNS resolution error rates return to normal levels, and all dependent activities should be functioning normally. We will continue to monitor for an additional period to ensure the issue is fully resolved before closing the incident. Further updates will be posted here if we observe any recurrence.

  3. resolved Mar 17, 2026, 11:33 PM UTC

    This incident has been resolved.

  4. postmortem Mar 18, 2026, 04:42 PM UTC

    Additional detail about timeline: we observed a relative increase in DNS resolver errors in us-east-1 beginning at approximately 2026-03-17 18:30 UTC. The elevated error rate was intermittent rather than constant, with the most visible clusters occurring from roughly 18:30-18:55 UTC and again from about 20:55-21:40 UTC. Error rates returned to normal shortly after 21:40 UTC.

Read the full incident report →

Critical December 31, 2025

Performance issues on auth.aptible.com

Detected by Pingoru
Dec 31, 2025, 10:48 AM UTC
Resolved
Dec 31, 2025, 02:49 PM UTC
Duration
4h
Affected: auth.aptible.com
Timeline · 3 updates
  1. investigating Dec 31, 2025, 10:48 AM UTC

    We are aware of degraded performance affecting auth.aptible.com. This affects all actions that interact with our APIs including login, viewing and managing resources, etc.

  2. monitoring Dec 31, 2025, 11:33 AM UTC

    Performance has improved since our mitigations. We are still monitoring the situation for continued degradation.

  3. resolved Dec 31, 2025, 02:49 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor November 5, 2025

Platform API Performance Degradation

Detected by Pingoru
Nov 05, 2025, 03:59 PM UTC
Resolved
Nov 05, 2025, 04:07 PM UTC
Duration
7m
Affected: api.aptible.comAptible Deploy
Timeline · 3 updates
  1. identified Nov 05, 2025, 03:59 PM UTC

    We are investigating an issue causing degraded performance for the Aptible Platform API. As part of the mitigation, a temporary operation block across all regions is in place which prevents new operations from starting. This includes deployments, scaling actions, SSH sessions, and database tunnels. Running apps and databases are not affected. Next update We will provide an update by 11:30 ET or sooner.

  2. monitoring Nov 05, 2025, 04:05 PM UTC

    A fix has been implemented and we are monitoring the results. The operation block has been lifted.

  3. resolved Nov 05, 2025, 04:07 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor October 29, 2025

Operations Impacted by Host Provisioning Failures

Detected by Pingoru
Oct 29, 2025, 09:17 PM UTC
Resolved
Oct 29, 2025, 09:50 PM UTC
Duration
33m
Affected: Aptible Deploy
Timeline · 3 updates
  1. investigating Oct 29, 2025, 09:17 PM UTC

    We are currently experiencing failures when provisioning new hosts across all regions. This impacts operations that require new capacity, including but not limited to: app deploys, scaling events, and new database creation. Existing running workloads are not affected. Our team is actively investigating and working to restore normal operation performance. We will provide an update by no later than 5:45pm EST.

  2. monitoring Oct 29, 2025, 09:33 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Oct 29, 2025, 09:50 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice October 28, 2025

Delays and failures for provisioning-related operations in us-east-1

Detected by Pingoru
Oct 28, 2025, 05:09 PM UTC
Resolved
Oct 28, 2025, 09:08 PM UTC
Duration
3h 59m
Affected: AWS EC2 (us-east-1 — Virginia)
Timeline · 5 updates
  1. investigating Oct 28, 2025, 05:09 PM UTC

    We are seeing significantly increased provisioning times in the AWS us-east-1 region. This is affecting operations that require new capacity, including deployments, scaling, and creation of new resources. These operations may time out and fail. Our team is actively investigating and working to mitigate the impact. We will provide our next update by 2 pm EST.

  2. monitoring Oct 28, 2025, 05:32 PM UTC

    AWS has confirmed the issue with EC2 provisioning issues in us-east-1. Meanwhile, we have been able to implement a mitigation for affected customers. We will continue to leave this issue open and monitor its progress, and will provide any relevant updates as needed.

  3. identified Oct 28, 2025, 05:53 PM UTC

    We are now seeing additional broader EC2 provisioning issues in the AWS us-east-1 region that are outside the scope of our mitigation. Additional delays/failures on operations that require new capacity in us-east-1 may still occur at this time. We will provide our next update by 2:30 pm EST.

  4. monitoring Oct 28, 2025, 06:35 PM UTC

    We are not currently seeing provisioning issues in AWS us-east-1. We are modifying the incident status to Monitoring, and will provide any relevant updates as needed.

  5. resolved Oct 28, 2025, 09:08 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor October 20, 2025

Operation failures to to AWS API incident

Detected by Pingoru
Oct 20, 2025, 07:47 AM UTC
Resolved
Oct 20, 2025, 09:46 AM UTC
Duration
1h 58m
Affected: AWS EC2 (us-east-1 — Virginia)Aptible Deploy
Timeline · 4 updates
  1. identified Oct 20, 2025, 07:47 AM UTC

    AWS is experiencing an issue with their APIs in us-east-1, and many services are impacted at this time. This may lead to failed operations, but there appears to be no impact to availability for apps and databases running on Aptible at this time.

  2. monitoring Oct 20, 2025, 08:31 AM UTC

    Since the AWS issues seem serious, and to avoid any operation failures causing resources to get stuck in a transitional state, we are taking action to block all new operations on the platform at this time. We have identified no impact to acceibilty or preformance of running apps and database containers at this time.

  3. monitoring Oct 20, 2025, 09:30 AM UTC

    AWS has indicated a fix has been applied, and we are re-enabling operations. We will continue to monitor for additional impact, as both Dockerhub and Quay.io were offline due to the AWS issue.

  4. resolved Oct 20, 2025, 09:46 AM UTC

    This incident has been resolved.

Read the full incident report →

Notice October 12, 2025

Delays and queued operations in us-east-1

Detected by Pingoru
Oct 12, 2025, 02:30 AM UTC
Resolved
Oct 12, 2025, 03:40 AM UTC
Duration
1h 10m
Timeline · 4 updates
  1. investigating Oct 12, 2025, 02:30 AM UTC

    One of our internal queuing systems observed a failure that has affected several operations enqueued in us-east-1. Our Reliability Team is restoring the queueing system and investigating the root cause of the failure.

  2. monitoring Oct 12, 2025, 02:42 AM UTC

    A fix has been implemented and we are monitoring the results.

  3. monitoring Oct 12, 2025, 03:40 AM UTC

    No operation queueing issues have been observed since the last update. If you observed any operations that failed or were stuck in a "pending" state, please retry the operations and they should run without errors.

  4. resolved Oct 12, 2025, 03:40 AM UTC

    This incident has been resolved.

Read the full incident report →

Notice October 10, 2025

Redis Security Advisory (CVE-2025-49844)

Detected by Pingoru
Oct 10, 2025, 03:53 PM UTC
Resolved
Oct 10, 2025, 03:53 PM UTC
Duration
Timeline · 1 update
  1. resolved Oct 10, 2025, 03:53 PM UTC

    Aptible has reviewed and addressed the Redis vulnerability described in https://redis.io/blog/security-advisory-cve-2025-49844/. We have confirmed that all internal Redis instances used in delivering the Aptible platform are secure and have no exploitation path related to this CVE. For Aptible customers who do not have their Redis databases publicly exposed, a path to exploitation is similarly unlikely. Additionally, for customers using Redis 6.2 databases on Aptible, a patched version will be available by Oct 10th at 5pm EST, and additional releases (7.2) are upcoming; see aptible.com/changelog for ongoing release updates. To ensure you are running the latest minor version when available, please run the cli command: aptible db:reload for each Redis database. For more information on using this command, visit https://www.aptible.com/docs/reference/aptible-cli/cli-commands/cli-db-reload. If you have any specific questions or concerns related to this CVE, please contact us.

Read the full incident report →

Notice August 7, 2025

Aptible Documentation Site Unavailable

Detected by Pingoru
Aug 07, 2025, 01:43 PM UTC
Resolved
Aug 07, 2025, 02:21 PM UTC
Duration
38m
Timeline · 2 updates
  1. investigating Aug 07, 2025, 01:43 PM UTC

    Our online documentation at aptible.com/docs is temporarily unavailable. We are working with our upstream provider to resolve the issue and will update this incident when it is resolved.

  2. resolved Aug 07, 2025, 02:21 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice June 5, 2025

Increased error rate

Detected by Pingoru
Jun 05, 2025, 07:34 PM UTC
Resolved
Jun 05, 2025, 08:32 PM UTC
Duration
57m
Affected: api.aptible.comAptible Deploy
Timeline · 3 updates
  1. investigating Jun 05, 2025, 07:34 PM UTC

    We are investigating an increased error rate in our API which may be causing failed operations

  2. monitoring Jun 05, 2025, 08:03 PM UTC

    The errors have been resolved, however we are still monitoring

  3. resolved Jun 05, 2025, 08:32 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor May 29, 2025

Route53 increased propagation delays

Detected by Pingoru
May 29, 2025, 10:04 PM UTC
Resolved
May 29, 2025, 10:24 PM UTC
Duration
20m
Affected: Aptible Deploy
Timeline · 2 updates
  1. monitoring May 29, 2025, 10:04 PM UTC

    We've noticed that some Operations are failing due to Route53 record changes not propagating within the 10 minute time limit allowed by our platform. Running App and Databases are not impacted, but creation or deletion of Databases or Endpoints, as well as scaling services to/from zero containers may be impacted. We'll continue to monitor the situation and provide updates as we have any additional information to shre.

  2. resolved May 29, 2025, 10:24 PM UTC

    Route 53 record propagation appears to have returned to normal.

Read the full incident report →

Minor April 4, 2025

Delayed Operations in eu-central-1

Detected by Pingoru
Apr 04, 2025, 10:54 PM UTC
Resolved
Apr 04, 2025, 11:26 PM UTC
Duration
31m
Affected: Aptible Deploy
Timeline · 2 updates
  1. identified Apr 04, 2025, 10:54 PM UTC

    We are currently experiencing issues with operations being delayed for stacks hosted in eu-central-1. Our Engineering team is currently working to restore normal functionality.

  2. resolved Apr 04, 2025, 11:26 PM UTC

    This incident has been resolved.

Read the full incident report →

Minor March 26, 2025

Delayed Operations

Detected by Pingoru
Mar 26, 2025, 04:53 PM UTC
Resolved
Mar 26, 2025, 05:10 PM UTC
Duration
17m
Affected: api.aptible.comAptible Deploy
Timeline · 3 updates
  1. investigating Mar 26, 2025, 04:53 PM UTC

    We are currently experiencing issues with operations being delayed. Our Engineering team is currently investigating.

  2. monitoring Mar 26, 2025, 05:02 PM UTC

    A fix has been implemented and we are monitoring the results.

  3. resolved Mar 26, 2025, 05:10 PM UTC

    This incident has been resolved.

Read the full incident report →

Notice February 20, 2025

App and Database operation failures

Detected by Pingoru
Feb 20, 2025, 12:01 AM UTC
Resolved
Feb 20, 2025, 02:37 AM UTC
Duration
2h 35m
Timeline · 2 updates
  1. monitoring Feb 20, 2025, 12:01 AM UTC

    We are experiencing intermittent failures in App and Database operations due to issues with an upstream provider. This issue only affects Apps and Databases with endpoints. Retrying the operation may resolve the issue. We are actively monitoring the situation and will provide updates once the problem is fully resolved.

  2. resolved Feb 20, 2025, 02:37 AM UTC

    This incident has been resolved.

Read the full incident report →

Major February 13, 2025

Operations blocked - Route 53 propagation delays

Detected by Pingoru
Feb 13, 2025, 11:51 PM UTC
Resolved
Feb 14, 2025, 12:48 AM UTC
Duration
57m
Affected: Aptible Deploy
Timeline · 3 updates
  1. identified Feb 13, 2025, 11:51 PM UTC

    We've noticed that some Operations are failing due to Route53 record changes not propagating within the 10 minute time limit allowed by our platform. In order to prevent Apps and Databases DNS records from reaching an inconsistent state, we are temporarily blocking Operations. Performance and reachability of existing Apps and Database is not impacted.

  2. monitoring Feb 14, 2025, 12:26 AM UTC

    We are noticing Route 53 record requests succeeding in a normal time frame, and are lifting the operation block at this time. We'll continue to observe running operations to ensure stability.

  3. resolved Feb 14, 2025, 12:48 AM UTC

    This incident has been resolved.

Read the full incident report →

Minor February 5, 2025

Database provision errors

Detected by Pingoru
Feb 05, 2025, 08:02 PM UTC
Resolved
Feb 05, 2025, 08:13 PM UTC
Duration
10m
Affected: Aptible Deploy
Timeline · 3 updates
  1. identified Feb 05, 2025, 08:02 PM UTC

    We've identified an error blocking the creation of new Databases on the platform, and our team is applying a fix. Reachability of your existing databases, and the ability to scale or restart them is not impacted.

  2. identified Feb 05, 2025, 08:03 PM UTC

    We are continuing to work on a fix for this issue.

  3. resolved Feb 05, 2025, 08:13 PM UTC

    This incident has been resolved.

Read the full incident report →

Major October 22, 2024

Long load balancer registration times

Detected by Pingoru
Oct 22, 2024, 06:27 PM UTC
Resolved
Oct 22, 2024, 06:51 PM UTC
Duration
24m
Affected: api.aptible.com
Timeline · 2 updates
  1. identified Oct 22, 2024, 06:27 PM UTC

    We are experiencing longer than usual Route53 change times, and some operations are unable to Rollback gracefully. In order to prevent resources from reaching a failed state where the DNS is not properly configured, we are blocking creation of new operations on the platform. We will update soon with additional information.

  2. resolved Oct 22, 2024, 06:51 PM UTC

    AWS has indicated that the underlying issue has been resolved, and our monitoring indicates it is safe to run operations again. All inconsistencies impacting customer apps or databases (there were only 4 impacted resources) have been resolved.

Read the full incident report →

Notice October 16, 2024

Limited Availability Incident in shared-us-west-1

Detected by Pingoru
Oct 16, 2024, 03:23 AM UTC
Resolved
Oct 16, 2024, 03:23 AM UTC
Duration
Timeline · 1 update
  1. resolved Oct 16, 2024, 03:23 AM UTC

    On 2024-10-16, between 00:20 and 02:38 UTC, some customer apps and databases in a single shared stack, shared-us-west-1, experienced an availability incident as a result of a problem encountered with planned maintenance. Service has been restored to those affected apps and databases, and this incident is considered resolved at this time.

Read the full incident report →