PortKey Outage History

PortKey had 12 outages in the last 2 years totaling 32h 20m of downtime — averaging 0.5 incidents per month.

There were 12 PortKey outages since May 24, 2024 totaling 32h 20m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.portkey.ai

Minor May 8, 2026

SaaS AI Gateway recovered

Detected by Pingoru: May 08, 2026, 10:29 AM UTC
Resolved: May 08, 2026, 10:29 AM UTC
Duration: —

Timeline · 1 update

resolved May 08, 2026, 10:29 AM UTC

SaaS AI Gateway recovered

Read the full incident report →

Minor April 30, 2026

Brief degradation in budget...

Detected by Pingoru: Apr 30, 2026, 06:20 AM UTC
Resolved: Apr 30, 2026, 08:00 AM UTC
Duration: 1h 40m

Affected: Portkey (SaaS AI Gateway)

Timeline · 3 updates

investigating Apr 30, 2026, 06:20 AM UTC

An upstream code change impacted the budget functonality for a few minutes.
investigating Apr 30, 2026, 06:20 AM UTC

An upstream code change caused a brief degradation in Portkey’s budget-related API functionality. During this window, requests depending on budget evaluation or enforcement may have experienced unexpected behavior. Core API routing remained operational, and Logs were not affected. The change has been identified and mitigated. We are continuing to monitor API behavior to ensure full stability.
resolved Apr 30, 2026, 08:00 AM UTC

The brief degradation affecting budget-related API functionality has been resolved. The issue was caused by an upstream code change and affected requests relying on budget evaluation or enforcement. Core API routing remained available throughout, and Logs were not affected. We have mitigated the change, verified recovery, and will continue monitoring to ensure stability.

Read the full incident report →

Minor March 5, 2026

Resolved: Sign-in Issue (Ja...

Detected by Pingoru: Mar 05, 2026, 09:51 AM UTC
Resolved: Mar 05, 2026, 10:45 AM UTC
Duration: 54m

Affected: Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating Mar 05, 2026, 09:51 AM UTC

We had a brief authentication issue on the control plane today. What happened: Between 9:50 AM - 10:45 AM UTC users who had signed out couldn't sign back in using email + password. New signups were also affected during this window. SSO and other auth methods continued to work normally.
resolved Mar 05, 2026, 10:45 AM UTC

Current status: Fully resolved. Email/password auth and signups are working as expected.

Read the full incident report →

Minor January 20, 2026

Dashboard and Control Plane...

Detected by Pingoru: Jan 20, 2026, 08:53 PM UTC
Resolved: Jan 20, 2026, 10:08 PM UTC
Duration: 1h 15m

Affected: Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating Jan 20, 2026, 08:53 PM UTC

We experienced degraded performance affecting the Portkey Dashboard and Prompt Render API between 2:23 AM - 3:38 AM IST on January 22, 2026. Impact: Dashboard timeouts and slow loading for some users. The Prompt Render API experienced intermittent failures. What is NOT affected: The AI Gateway remained fully operational throughout. All inference requests and model calls continued to work normally with no failures.
resolved Jan 20, 2026, 10:08 PM UTC

Dashboard and /prompt/render APIs are up now. Root cause: An unusually high volume of SCIM provisioning updates created unexpected load on our control plane database. Resolution: The issue has been resolved and all services are operating normally. We are implementing additional rate limiting and scaling measures to prevent similar occurrences.

Read the full incident report →

Minor January 15, 2026

Logs Visibility Issue (Jan ...

Detected by Pingoru: Jan 15, 2026, 08:54 AM UTC
Resolved: Jan 16, 2026, 08:30 AM UTC
Duration: 23h 36m

Affected: Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating Jan 15, 2026, 08:54 AM UTC

A code update introduced a new metric that occasionally resulted in failed Clickhouse inserts, causing logs to not appear in the UI.
resolved Jan 16, 2026, 08:30 AM UTC

Logs are working as expected now.

Read the full incident report →

Major November 18, 2025

Service Outage

Detected by Pingoru: Nov 18, 2025, 11:29 AM UTC
Resolved: Nov 18, 2025, 02:00 PM UTC
Duration: 2h 31m

Affected: Portkey (SaaS AI Gateway)Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating Nov 18, 2025, 11:29 AM UTC

Portkey is impacted by the global Cloudflare outage. SaaS Gateway and the control plane are facing intermittent failures. We are actively investigating the issue and working to get the system back up.
resolved Nov 18, 2025, 02:00 PM UTC

Service is back up now.

Read the full incident report →

Minor July 22, 2025

Cache Collision (Now Resolved)

Detected by Pingoru: Jul 22, 2025, 11:32 PM UTC
Resolved: Jul 22, 2025, 11:32 PM UTC
Duration: —

Affected: Portkey (SaaS AI Gateway)

Timeline · 1 update

resolved Jul 22, 2025, 11:32 PM UTC

While working to fix some caching issues yesterday, we inadvertently introduced a bug that caused erroneous cache collisions. This issue may have affected your requests that had cache enabled. If you were using different metadata or cache namespaces, your requests were properly isolated and unaffected from this bug. Please Note: - The collisions did not happen between cache partitions - The collisions did not happen across accounts The issue has been fully reverted as of Jul 23, 2025 3:23 UTC. Our team is actively monitoring the system to ensure stability. We are sorry for this!

Read the full incident report →

Major June 12, 2025

Broad Service Outage

Detected by Pingoru: Jun 12, 2025, 06:20 PM UTC
Resolved: Jun 12, 2025, 08:34 PM UTC
Duration: 2h 14m

Affected: Portkey (SaaS AI Gateway)Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating Jun 12, 2025, 06:20 PM UTC

We wish it was April 1st, but the internet is just not feeling it today. We're experiencing an outage on the UI due to an ongoing outage with GCE and Firebase. The AI gateway requests are also experiencing increased latencies due to cloudflare outages. We're continuously monitoring our services and working to get them back soon.
resolved Jun 12, 2025, 08:34 PM UTC

Latencies have reduced significantly and services seem up. We're continuing to monitor services and requests. Please reach out on Discord or [email protected] if you're facing issues.

Read the full incident report →

Major March 13, 2025

SaaS AI Gateway is down

Detected by Pingoru: Mar 13, 2025, 08:10 PM UTC
Resolved: Mar 13, 2025, 08:10 PM UTC
Duration: 22s

Affected: Portkey (SaaS AI Gateway)

Timeline · 2 updates

investigating Mar 13, 2025, 08:10 PM UTC

API went down.
resolved Mar 13, 2025, 08:10 PM UTC

API recovered.

Read the full incident report →

Major October 26, 2024

API is down

Detected by Pingoru: Oct 26, 2024, 10:48 PM UTC
Resolved: Oct 26, 2024, 10:49 PM UTC
Duration: 26s

Affected: Portkey (SaaS AI Gateway)

Timeline · 2 updates

investigating Oct 26, 2024, 10:48 PM UTC

API went down.
resolved Oct 26, 2024, 10:49 PM UTC

API recovered.

Read the full incident report →

Major July 19, 2024

API services were slow/down

Detected by Pingoru: Jul 19, 2024, 06:18 AM UTC
Resolved: Jul 19, 2024, 06:18 AM UTC
Duration: —

Affected: Portkey (SaaS AI Gateway)Portkey (SaaS Control Plane)

Timeline · 1 update

resolved Jul 19, 2024, 06:18 AM UTC

We deployed an authorization layer update to Portkey which caused our APIs to return 401 to a majority requests. The service came back after 2 minutes of downtime.

Read the full incident report →

Minor May 24, 2024

Logs screen is intermittent...

Detected by Pingoru: May 24, 2024, 09:34 AM UTC
Resolved: May 24, 2024, 09:44 AM UTC
Duration: 10m

Affected: Portkey (SaaS Control Plane)

Timeline · 2 updates

investigating May 24, 2024, 09:34 AM UTC

Writes to our metrics cluster are delayed and the logs screen is intermittently failing. We're investigating the issue.
resolved May 24, 2024, 09:44 AM UTC

We have fixed the logs for now and are looking at the dropped logs. We'll replay them within the next 4 hours.

Read the full incident report →