Okta experienced a major incident on November 17, 2025 affecting okta.com cell 14 and Core Platform, lasting 7d 5h. The incident has been resolved; the full update timeline is below.
Affected components
Update timeline
- resolved Nov 17, 2025, 06:34 PM UTC
We detected a service interruption that impacted all users for approximately 10 minutes at 10:34 AM PT. Users may have encountered intermittent delays, connection errors, or 503/504 error messages. The issue is no longer active. We are monitoring for residual effects and will provide an update in the next 30 minutes.
- resolved Nov 17, 2025, 07:44 PM UTC
After monitoring for an extended period, we confirmed that the issue has not reoccurred. A root cause analysis (RCA) will be posted here within five business days. We sincerely apologize for the impact this had on you and your users.
- resolved Nov 25, 2025, 12:21 AM UTC
We sincerely apologize for any impact this incident has caused to you, your business, and your customers. At Okta, trust and transparency are our top priorities. Outlined below are the facts regarding this incident. We are committed to implementing improvements to the service to prevent future occurrences of this incident. Detection and Impact: On November 17th at 10:35 AM (PT), Okta received alerts regarding resource access errors in US Cell 14. During this time, customers in US Cell 14 experienced errors, connection timeouts, and slow responses while attempting to access resources. This incident was isolated to their cell, and normal processing levels were restored at 10:45 AM (PT). Root Cause Summary: Okta determined the root cause was a substantial increase in request traffic volume, which resulted in errors accessing the cell. Remediation Steps: The system automatically initiated service protection within 1 minute of a traffic increase, followed by additional resource scaling actions. These automated actions addressed the issue, and the team confirmed that response times and rates had fully returned to normal by 10:45 AM(PT). Incident response teams continued to review and analyze the incident, applying additional service protection configurations to improve service resiliency immediately after the service was confirmed to be stable. Preventative Actions: In order to prevent similar incidents from recurring, Okta is working in collaboration with our service provider to explore opportunities for enhanced scalability and faster mitigation action thresholds where necessary. We are fully committed to improving our infrastructure to ensure the reliability of our services. Our priority remains the continuous optimization of our systems to safeguard against future events. Duration (# of minutes): 11 minutes Actual Time: 10:34 AM - 10:45 AM PST