- Detected by Pingoru
- Jun 16, 2026, 09:49 AM UTC
- Resolved
- Jun 16, 2026, 08:42 PM UTC
- Duration
- 10h 53m
Affected: India - Egress
Timeline · 2 updates
-
monitoring Jun 16, 2026, 09:49 AM UTC
We identified an issue in the India region where a small number of active sessions may have ended earlier than expected. Affected sessions could experience egress recordings stopping mid-session. We've deployed a mitigation and are monitoring to confirm resolution. Other regions are unaffected.
-
resolved Jun 16, 2026, 08:42 PM UTC
We have not seen further instances of this issue since the mitigation was applied. We will follow up with a postmortem as soon as possible.
Read the full incident report →
- Detected by Pingoru
- Jun 11, 2026, 05:52 PM UTC
- Resolved
- Jun 11, 2026, 11:47 PM UTC
- Duration
- 5h 54m
Affected: Global Inference
Timeline · 3 updates
-
investigating Jun 11, 2026, 05:52 PM UTC
We are currently investigating intermittent 429 errors for Google Gemini models routed through LiveKit Inference.
-
identified Jun 11, 2026, 07:09 PM UTC
We have identified the cause as an upstream limit on our Google Gemini API account. Automatic failover to an alternate Gemini deployment is in place and has reduced the impact, but a subset of requests to Google Gemini models may still intermittently return errors when failover capacity is exceeded. We are actively engaged with Google support to restore full capacity and will share further updates as we have them. Other models and providers remain unaffected.
-
resolved Jun 11, 2026, 11:47 PM UTC
This incident has been resolved. The errors were caused by an upstream issue at our model provider (Google) that incorrectly triggered an account-level usage cap on our Gemini API access, returning rate-limit errors for a subset of Gemini requests routed through LiveKit Inference. Google identified the cause, rolled back the change on their side, and raised our account limits to prevent recurrence. Gemini requests are now serving normally and error rates have returned to baseline. Other models and providers were unaffected throughout.
Read the full incident report →
- Detected by Pingoru
- Jun 11, 2026, 03:30 PM UTC
- Resolved
- Jun 11, 2026, 03:30 PM UTC
- Duration
- —
Timeline · 1 update
-
resolved Jun 11, 2026, 03:50 PM UTC
Between 13:00 and 15:05 UTC, a subset of requests to certain Google Gemini preview models (gemini-3.1-flash-lite and gemini-3-flash-preview) routed through LiveKit Inference returned errors due to a project-level misconfiguration. These preview models did not yet have model failover configured; all other models and providers were unaffected. This was resolved by routing the traffic via an alternate provider and fixing the misconfiguration. As resolution measures, we are extending model-level failover to cover these models and adding increased monitoring for such failures in the future.
Read the full incident report →
- Detected by Pingoru
- Jun 11, 2026, 01:00 PM UTC
- Resolved
- Jun 11, 2026, 01:00 PM UTC
- Duration
- —
Timeline · 1 update
-
resolved Jun 17, 2026, 09:53 PM UTC
On 2026-06-11 at 13:00 UTC, a serialization bug was introduced via frontend components which rendered some services unavailable via the dashboard for some users. These services included SIP trunk and dispatch rule creation/management as well as several advanced features in Agent Builder. A fix was pushed on 2026-06-17 at 17:43 UTC which resolved the issue.
Read the full incident report →
- Detected by Pingoru
- Jun 05, 2026, 09:06 AM UTC
- Resolved
- Jun 05, 2026, 11:08 AM UTC
- Duration
- 2h 1m
Affected: India - SIPIndia - Ingress
Timeline · 3 updates
-
investigating Jun 05, 2026, 09:06 AM UTC
We're investigating elevated latency affecting Ingress and SIP services in our Hyderabad region.
-
investigating Jun 05, 2026, 10:05 AM UTC
As of 09:21 UTC, we've applied a mitigation by routing traffic away from the Hyderabad region. Failure rates are returning to normal levels. We're continuing to monitor the issue.
-
resolved Jun 05, 2026, 11:08 AM UTC
We identified a connectivity problem in the Hyderabad cluster. As mitigation, we've drained the cluster to route traffic away from it. Mumbai traffic is unaffected and serving normally. We'll undrain Hyderabad once we're confident it's functioning normally.
Read the full incident report →
- Detected by Pingoru
- Jun 03, 2026, 08:00 PM UTC
- Resolved
- Jun 03, 2026, 08:00 PM UTC
- Duration
- —
Timeline · 1 update
Read the full incident report →
- Detected by Pingoru
- Jun 02, 2026, 02:05 PM UTC
- Resolved
- Jun 02, 2026, 03:18 PM UTC
- Duration
- 1h 12m
Affected: US Central - SIP
Timeline · 5 updates
Read the full incident report →
- Detected by Pingoru
- May 28, 2026, 02:53 PM UTC
- Resolved
- May 28, 2026, 08:13 PM UTC
- Duration
- 5h 20m
Affected: Global Real Time CommunicationUS East - Real Time Communication
Timeline · 7 updates
Read the full incident report →
- Detected by Pingoru
- May 26, 2026, 07:53 PM UTC
- Resolved
- May 26, 2026, 09:32 PM UTC
- Duration
- 1h 38m
Affected: Cloud Dashboard (cloud.livekit.io)
Timeline · 4 updates
-
investigating May 26, 2026, 07:22 PM UTC
We are investigating reports of Agent Observability not accessible for some sessions.
-
identified May 26, 2026, 07:53 PM UTC
We have identified an issue causing partial data unavailability in the LiveKit Cloud Dashboard: For some users, Agent Observability is temporarily unavailable for some sessions. Some sessions in the 24-hour filter are showing as Active even though they have ended. A fix has been identified and we are backfilling the missing data.
-
monitoring May 26, 2026, 08:13 PM UTC
A fix has been implemented and we are seeing no more data outage for newly created sessions. We are monitoring the fix and are currently working on backfilling the missing data.
-
resolved May 26, 2026, 09:32 PM UTC
All missing data has now been backfilled and the incident is resolved.
Read the full incident report →
- Detected by Pingoru
- May 22, 2026, 08:26 PM UTC
- Resolved
- May 23, 2026, 04:36 PM UTC
- Duration
- 20h 10m
Timeline · 3 updates
-
investigating May 22, 2026, 08:26 PM UTC
We have observed network instability impacting cross-region traffic from Frankfurt beginning at 18:15 UTC. Traffic has been redirected from Frankfurt as of 19:55 UTC. Participants originally connecting via Frankfurt may see slightly increased latency until the region is restored.
-
monitoring May 23, 2026, 12:17 AM UTC
We have re-enabled our Frankfurt cluster and are monitoring for further disruptions.
-
resolved May 23, 2026, 04:36 PM UTC
We have been monitoring the clusters for the past 16 hours and have not found any further disruptions.
Read the full incident report →
- Detected by Pingoru
- May 15, 2026, 04:57 PM UTC
- Resolved
- May 16, 2026, 02:50 AM UTC
- Duration
- 9h 53m
Timeline · 4 updates
Read the full incident report →
- Detected by Pingoru
- May 07, 2026, 09:15 PM UTC
- Resolved
- May 07, 2026, 09:15 PM UTC
- Duration
- —
Timeline · 1 update
-
resolved May 08, 2026, 08:25 PM UTC
Between ~17:16 and 17:21 UTC on May 7, 2026, inter-region connectivity between our US East region and other regions was briefly degraded due to a network connectivity issue in our underlying cloud provider's data center. During this window, sessions that relied on cross-region track relay through US East may have experienced brief connection failures or reconnections for some participants. Our system recovered automatically once the cloud provider's data center network connectivity was restored.
Read the full incident report →
- Detected by Pingoru
- May 06, 2026, 07:46 PM UTC
- Resolved
- May 06, 2026, 11:04 PM UTC
- Duration
- 3h 17m
Affected: Global SIP
Timeline · 5 updates
Read the full incident report →
- Detected by Pingoru
- May 06, 2026, 04:01 AM UTC
- Resolved
- May 06, 2026, 09:41 AM UTC
- Duration
- 5h 39m
Affected: US East - Cloud Agents
Timeline · 9 updates
Read the full incident report →
- Detected by Pingoru
- Apr 24, 2026, 06:12 PM UTC
- Resolved
- Apr 29, 2026, 02:41 PM UTC
- Duration
- 4d 20h
Affected: India - Real Time CommunicationAustralia - Real Time CommunicationSingapore - Real Time CommunicationUS Central - Real Time CommunicationSouth Africa - Real Time CommunicationUS East - Real Time Communication
Timeline · 3 updates
-
investigating Apr 24, 2026, 06:12 PM UTC
We received a single report of CreateSIPParticipant timeouts in Singapore. While investigating this, we discovered an increased rate of signalling connection errors on a very small minority of requests, mainly in India. We haven’t received any other reports, but are proactively creating this incident in case users encounter increased connection latency. We are continuing to investigate and will update here once we know more.
-
identified Apr 24, 2026, 11:20 PM UTC
A fix is being implemented. We still have not received any further reports and believe the impact to be minor, but we're continuing to monitor for further issues.
-
resolved Apr 29, 2026, 02:41 PM UTC
We are closing this incident as we have not received further reports and the signalling error rate has dropped back to baseline. We made some fine-tuning adjustments to our stack to improve performance and will continue to seek out opportunities to keep latency low.
Read the full incident report →
- Detected by Pingoru
- Apr 22, 2026, 01:00 PM UTC
- Resolved
- Apr 22, 2026, 01:00 PM UTC
- Duration
- —
Timeline · 1 update
Read the full incident report →
- Detected by Pingoru
- Apr 15, 2026, 11:23 PM UTC
- Resolved
- Apr 16, 2026, 04:09 AM UTC
- Duration
- 4h 45m
Affected: Global Real Time Communication
Timeline · 8 updates
Read the full incident report →
- Detected by Pingoru
- Apr 08, 2026, 04:18 PM UTC
- Resolved
- Apr 08, 2026, 04:18 PM UTC
- Duration
- —
Affected: Cloud Dashboard (cloud.livekit.io)
Timeline · 1 update
-
resolved Apr 08, 2026, 04:18 PM UTC
We have identified a bug where LiveKit Cloud projects created after April 1 at 20:41 UTC which enabled Agent Insights encountered a bug where their observability data (recordings, traces, and agent logs) was not saved correctly. We have resolved the issue on April 8 at 03:19 UTC and Agent Insights should be fully operational for all new and existing projects. No action is required from impacted users. We will follow up with a postmortem as soon as possible.
Read the full incident report →
- Detected by Pingoru
- Apr 03, 2026, 05:13 PM UTC
- Resolved
- Apr 03, 2026, 05:23 PM UTC
- Duration
- 9m
Affected: Cloud Dashboard (cloud.livekit.io)
Timeline · 3 updates
Read the full incident report →
- Detected by Pingoru
- Apr 01, 2026, 10:37 AM UTC
- Resolved
- Apr 01, 2026, 01:01 PM UTC
- Duration
- 2h 23m
Affected: Europe Central - TURNEurope Central - Real Time Communication
Timeline · 2 updates
-
monitoring Apr 01, 2026, 10:37 AM UTC
We identified degraded connection errors affecting a small percentage of requests to Real Time Communication and TURN services in our EU (Frankfurt) region. Automatic retries prevented applications from experiencing failures. A fix has been implemented and we are actively monitoring to confirm stability.
-
resolved Apr 01, 2026, 01:01 PM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Mar 30, 2026, 11:26 AM UTC
- Resolved
- Mar 30, 2026, 09:01 PM UTC
- Duration
- 9h 34m
Affected: Cloud Dashboard (cloud.livekit.io)
Timeline · 6 updates
-
investigating Mar 30, 2026, 11:26 AM UTC
We are currently investigating the issue. Only dashboard updates are affected (real time communication is not affected).
-
identified Mar 30, 2026, 03:01 PM UTC
The issue has been identified. We are working on a fix.
-
identified Mar 30, 2026, 06:46 PM UTC
A fix has been deployed. We expect the service to be restored shortly and will provide another update once it is back online.
-
monitoring Mar 30, 2026, 08:12 PM UTC
The fix has been applied and real-time processing for the Cloud Dashboard is back online globally. We are monitoring the pipeline.
-
monitoring Mar 30, 2026, 08:55 PM UTC
We have initiated the process of backfilling delayed analytics data.
-
resolved Mar 30, 2026, 09:01 PM UTC
Real-time analytics updates on Cloud Dashboard is now fully functional, and delayed data are in the process of being backfilled.
Read the full incident report →
- Detected by Pingoru
- Mar 30, 2026, 09:09 AM UTC
- Resolved
- Mar 30, 2026, 09:45 AM UTC
- Duration
- 36m
Affected: Cloud Dashboard (cloud.livekit.io)
Timeline · 4 updates
-
investigating Mar 30, 2026, 09:09 AM UTC
Users may be unable to access the dashboard. Our team is actively investigating.
-
investigating Mar 30, 2026, 09:09 AM UTC
We are continuing to investigate this issue.
-
monitoring Mar 30, 2026, 09:20 AM UTC
A fix has been implemented and we are monitoring the result
-
resolved Mar 30, 2026, 09:45 AM UTC
This incident has been resolved.
Read the full incident report →
- Detected by Pingoru
- Mar 23, 2026, 06:00 PM UTC
- Resolved
- Mar 23, 2026, 06:00 PM UTC
- Duration
- —
Timeline · 1 update
-
resolved Mar 24, 2026, 01:17 AM UTC
Between 15:00–17:25 UTC, a subset of SIP outbound calls in the US East region experienced failures due to an internal DNS resolution issue on a single host. Approximately 1.22% of calls within the region were affected. The affected host was taken out of rotation and workloads were redistributed. Call failure rates returned to baseline following remediation.
Read the full incident report →
- Detected by Pingoru
- Mar 18, 2026, 06:30 PM UTC
- Resolved
- Mar 18, 2026, 06:30 PM UTC
- Duration
- —
Timeline · 1 update
-
resolved Mar 18, 2026, 09:04 PM UTC
A dependency update in our latest docs deployment caused all requests to the MCP server at docs.livekit.io/mcp/ to return 500 errors. The issue began at 18:50 UTC and ended at 19:33 UTC. During this roughly 45 minute window, MCP clients - including IDE integrations and the LiveKit CLI - were unable to connect to the docs MCP server. The issue was caused by a package upgrade that introduced dependencies incompatible with our serverless hosting environment. We rolled back the change and confirmed the endpoint was fully operational at 19:33 UTC. No other services were affected.
Read the full incident report →
- Detected by Pingoru
- Mar 18, 2026, 11:25 AM UTC
- Resolved
- Mar 18, 2026, 11:25 AM UTC
- Duration
- —
Timeline · 1 update
-
resolved Mar 18, 2026, 11:25 AM UTC
We experienced degraded networking affecting core real time communication services in Mumbai region. The issue began at 06:06 UTC and was resolved by 06:22 UTC on March 18, 2026. All services are now restored and we are continuing to monitor.
Read the full incident report →