Retell AI Outage History

Retell AI had 44 outages in the last 2 years totaling 24h 48m of downtime — averaging 1.8 incidents per month.

There were 44 Retell AI outages since March 14, 2025 totaling 24h 48m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.retellai.com

Minor June 12, 2026

Platform voice Grace and Nico not working

Detected by Pingoru: Jun 12, 2026, 07:30 AM UTC
Resolved: Jun 12, 2026, 04:44 PM UTC
Duration: 9h 13m

Affected: Web CallPhone Call

Timeline · 2 updates

investigating Jun 12, 2026, 04:31 PM UTC

The platform voice, Grace and Nico, are not working
resolved Jun 12, 2026, 04:44 PM UTC

The issue is resolved now

Read the full incident report →

Minor May 28, 2026

Certain calls are not being picked up in time

Detected by Pingoru: May 28, 2026, 03:00 PM UTC
Resolved: May 28, 2026, 04:30 PM UTC
Duration: 1h 29m

Affected: Web CallPhone Call

Timeline · 2 updates

investigating May 28, 2026, 04:37 PM UTC

Many calls are experiencing high latency, no pickup, or being dropped due to web socket connection issues. We are investigating now.
resolved May 28, 2026, 05:52 PM UTC

This incident has been resolved. It was due to a connection issue in one of the data centers for the telephony provider. We are adding more fallback routes there.

Read the full incident report →

Major May 25, 2026

Calls returning concurrency_limit_reached

Detected by Pingoru: May 25, 2026, 11:00 PM UTC
Resolved: May 25, 2026, 11:41 PM UTC
Duration: 40m

Affected: Web CallPhone Call

Timeline · 1 update

investigating May 25, 2026, 11:00 PM UTC

Due to an error in the code, the concurrency of orgs are incorrectly calculated and returning concurrency_limit_reached for many calls.

Read the full incident report →

Major May 25, 2026

Call/chat history, analytics and QA dashboard down

Detected by Pingoru: May 25, 2026, 09:09 PM UTC
Resolved: May 25, 2026, 09:40 PM UTC
Duration: 30m

Affected: AlertingAnalyticsDashboardQA

Timeline · 5 updates

investigating May 25, 2026, 09:09 PM UTC

Due to a database outage, call/chat history, analytics and QA dashboard down are currently completely down. We are working with the database team to fix the issue.
investigating May 25, 2026, 09:12 PM UTC

We are continuing to investigate this issue.
investigating May 25, 2026, 09:13 PM UTC

We are continuing to investigate this issue.
identified May 25, 2026, 09:33 PM UTC

We are rolling out a fix to direct the traffic to our database replica while the provider is investigating the issue. Note that no call or chat data were lost.
resolved May 25, 2026, 09:40 PM UTC

The issue has been fixed and all services are back to working.

Read the full incident report →

Major May 8, 2026

Agent reading and publishing broken

Detected by Pingoru: May 08, 2026, 03:00 AM UTC
Resolved: May 08, 2026, 03:00 AM UTC
Duration: —

Timeline · 1 update

resolved May 08, 2026, 04:50 PM UTC

agent publishing was broken for ~1 hour - returned server error. Calling was not affected

Read the full incident report →

Minor April 13, 2026

KB Retrieval and QA Failure

Detected by Pingoru: Apr 13, 2026, 11:47 PM UTC
Resolved: Apr 14, 2026, 12:15 AM UTC
Duration: 28m

Affected: QAKnowledge Base

Timeline · 2 updates

monitoring Apr 13, 2026, 11:47 PM UTC

The KB retrieval and QA experienced an issue due to an unintended code change. The issue has been resolved, and we are actively monitoring the systems to ensure continued stability.
resolved Apr 14, 2026, 12:15 AM UTC

This incident has been resolved.

Read the full incident report →

Minor March 16, 2026

Batch calls not sent

Detected by Pingoru: Mar 16, 2026, 06:41 PM UTC
Resolved: Mar 16, 2026, 07:21 PM UTC
Duration: 40m

Affected: Phone Call

Timeline · 3 updates

investigating Mar 16, 2026, 06:41 PM UTC

Starting ~13:32 UTC (8:32 AM PT), batch calls were not being sent due to an internal issue that was triggered in our system. We've identified the root cause and are actively working on a fix.
monitoring Mar 16, 2026, 07:14 PM UTC

A fix is being deployed and we are monitoring to ensure recovery. As a side effect, batch calls that were missed are being marked as "sent".
resolved Mar 16, 2026, 07:21 PM UTC

This incident has been resolved.

Read the full incident report →

Minor March 16, 2026

Custom LLM calls failing to start

Detected by Pingoru: Mar 16, 2026, 04:39 AM UTC
Resolved: Mar 16, 2026, 04:54 AM UTC
Duration: 14m

Affected: Web CallPhone Call

Timeline · 2 updates

identified Mar 16, 2026, 04:39 AM UTC

19:38 PST: custom LLM calls started failing to start due to a bad release. Other types of calls are not impacted. 21:35 PST: a fix is being rolled out, we are keep monitoring the issue.
resolved Mar 16, 2026, 04:54 AM UTC

This incident has been resolved.

Read the full incident report →

Notice March 4, 2026

Platform/MiniMax TTS issue

Detected by Pingoru: Mar 04, 2026, 10:34 PM UTC
Resolved: Mar 04, 2026, 10:34 PM UTC
Duration: —

Affected: Web CallPhone Call

Timeline · 1 update

resolved Mar 04, 2026, 10:34 PM UTC

Between 1:25–2:08pm PT, a provider-side rate-limit configuration issue caused some TTS requests to be silently dropped. This issue has been resolved, and we are adding detection to make sure a fallback will be triggered if this happens again.

Read the full incident report →

Notice March 4, 2026

Small portion of community voice not working

Detected by Pingoru: Mar 04, 2026, 06:21 PM UTC
Resolved: Mar 04, 2026, 02:30 PM UTC
Duration: —

Timeline · 1 update

resolved Mar 04, 2026, 06:21 PM UTC

There was an accidental change in the logic of account cleanup that led to the deletion of small portion of community voice resources, which caused those agents with the voice having issues. We have fixed the logic, and have been running a backfill to detect and add back those resources.

Read the full incident report →

Minor February 20, 2026

Telynx - SIP Trunking Service Degradation

Detected by Pingoru: Feb 20, 2026, 09:43 PM UTC
Resolved: Feb 21, 2026, 12:00 AM UTC
Duration: 2h 16m

Affected: Phone Call

Timeline · 2 updates

identified Feb 20, 2026, 09:43 PM UTC

We are currently experiencing service degradation affecting SIP trunking through our upstream provider, Telnyx. This may result in call connection issues. https://status.telnyx.com/
resolved Feb 21, 2026, 02:01 AM UTC

This incident has been resolved.

Read the full incident report →

Major February 10, 2026

Calls failing to start and concurrency stuck

Detected by Pingoru: Feb 10, 2026, 10:12 PM UTC
Resolved: Feb 10, 2026, 10:18 PM UTC
Duration: 5m

Affected: Phone Call

Timeline · 2 updates

identified Feb 10, 2026, 10:12 PM UTC

There were call initialization issues between 1:35pm and 1:43pm PST. Calls are back online now, but concurrency for some users may be stuck. We are working on a fix.
resolved Feb 10, 2026, 10:18 PM UTC

Calls and concurrency should be back to normal.

Read the full incident report →

Minor January 27, 2026

Scheduled batch calls not running

Detected by Pingoru: Jan 27, 2026, 10:30 PM UTC
Resolved: Jan 27, 2026, 10:30 PM UTC
Duration: —

Timeline · 1 update

resolved Jan 28, 2026, 02:52 AM UTC

Scheduled batch calls did not run from 2:35 PM to 5:15 PM PST. The service has now recovered. Any batch calls whose scheduling window still included 5:15 PM were executed at that time. Batch calls whose scheduling window was fully missed were not executed.

Read the full incident report →

Major January 22, 2026

Multilingual transcription issues

Detected by Pingoru: Jan 22, 2026, 11:37 PM UTC
Resolved: Jan 22, 2026, 11:48 PM UTC
Duration: 11m

Affected: Web CallPhone Call

Timeline · 2 updates

investigating Jan 22, 2026, 11:37 PM UTC

We're aware of issues affecting multilingual transcription and are actively investigating.
resolved Jan 22, 2026, 11:48 PM UTC

The issue appears to be resolved and multilingual transcription is recovering. We'll keep monitoring to ensure stability.

Read the full incident report →

Major January 9, 2026

Outbound Calling Service Degradation

Detected by Pingoru: Jan 09, 2026, 08:30 AM UTC
Resolved: Jan 09, 2026, 10:26 AM UTC
Duration: 1h 56m

Affected: Phone Call

Timeline · 4 updates

investigating Jan 09, 2026, 09:26 AM UTC

Outbound calls are currently experiencing issues. Some users report receiving multiple calls from a single dial attempt, or agents being unable to hear audio. We are investigating the root cause.
identified Jan 09, 2026, 09:58 AM UTC

The root cause has been traced to our telephony stack provider. The issue has been escalated to their support team, who are actively triaging the incident.
monitoring Jan 09, 2026, 10:17 AM UTC

The issue should have been resolved, we are closely monitoring the status.
resolved Jan 09, 2026, 10:26 AM UTC

The issue has been fixed

Read the full incident report →

Major December 30, 2025

ASR configuration bug

Detected by Pingoru: Dec 30, 2025, 09:20 PM UTC
Resolved: Dec 30, 2025, 10:04 PM UTC
Duration: 44m

Timeline · 4 updates

investigating Dec 30, 2025, 09:20 PM UTC

A recent code change introduced an issue in the ASR module that may cause some agents to go offline. We are working on a fix actively.
investigating Dec 30, 2025, 09:20 PM UTC

We are continuing to investigate this issue.
monitoring Dec 30, 2025, 10:00 PM UTC

The fix is being pushed to the production, we are observing and monitoring the current system status.
resolved Dec 30, 2025, 10:04 PM UTC

This incident has been resolved.

Read the full incident report →

Major December 16, 2025

Call and API failures due to Stripe outage

Detected by Pingoru: Dec 16, 2025, 03:10 PM UTC
Resolved: Dec 16, 2025, 03:40 PM UTC
Duration: 30m

Affected: APIWeb CallPhone Call

Timeline · 2 updates

monitoring Dec 16, 2025, 04:59 PM UTC

For around 30 minutes, calls and API requests failed due to an issue with Stripe. The outage occurred between 7:10 and 7:40 PST. This has been identified as an outage from our payment provider (Stripe). We are continuing to monitor the situation.
resolved Dec 16, 2025, 07:56 PM UTC

This incident has been resolved.

Read the full incident report →

Notice December 10, 2025

Transcription error for non English traffic

Detected by Pingoru: Dec 10, 2025, 08:41 PM UTC
Resolved: Dec 10, 2025, 07:30 PM UTC
Duration: —

Timeline · 1 update

resolved Dec 10, 2025, 08:41 PM UTC

For around 20 minutes, there was a spike of transcription error for certain non English traffic (notably multilingual, Spanish, etc). English traffic was not impacted. This has been identified as outages from underlying providers. We are working on adding more fallback routes to improve the stability of the platform.

Read the full incident report →

Minor November 26, 2025

Telnyx calls encountering telephony provider permission denied

Detected by Pingoru: Nov 26, 2025, 06:48 PM UTC
Resolved: Nov 26, 2025, 06:52 PM UTC
Duration: 3m

Affected: Phone Call

Timeline · 2 updates

investigating Nov 26, 2025, 06:48 PM UTC

We have observed that calls through retell Telnyx numbers are having issues.
resolved Nov 26, 2025, 06:52 PM UTC

This has been resolved now. This is related to an elastic IP issue, which is mitigated now.

Read the full incident report →

Notice October 28, 2025

Intermittent Call Connection Failures

Detected by Pingoru: Oct 28, 2025, 07:28 PM UTC
Resolved: Oct 28, 2025, 07:28 PM UTC
Duration: —

Affected: Phone Call

Timeline · 1 update

resolved Oct 28, 2025, 07:28 PM UTC

From 11:10 AM – 11:34 AM PST, we observed that inbound calls were experiencing connection issues and timeouts. We have identified the issue as being caused by a brief outage with our telephony provider. The issue has since been resolved and all services are operating normally.

Read the full incident report →

Minor October 20, 2025

AWS outage causing some login issues, and history and analytics issue. Calls are NOT impacted.

Detected by Pingoru: Oct 20, 2025, 07:02 PM UTC
Resolved: Oct 21, 2025, 12:42 AM UTC
Duration: 5h 40m

Affected: Dashboard

Timeline · 3 updates

identified Oct 20, 2025, 07:02 PM UTC

Starting from Oct 20 1am PST, the AWS outage has caused some login issues, and call history and analytics issues. Regular calls are NOT impacted. Once the AWS outage is over, we will backfill the analytics.
monitoring Oct 20, 2025, 11:51 PM UTC

The AWS outage has been resolved. We are going to backfill analytics, and keep a close eye on it.
resolved Oct 21, 2025, 12:42 AM UTC

This incident has been resolved.

Read the full incident report →

Notice October 9, 2025

Connection issues with calls

Detected by Pingoru: Oct 09, 2025, 09:40 PM UTC
Resolved: Oct 09, 2025, 09:40 PM UTC
Duration: —

Affected: Web CallPhone Call

Timeline · 1 update

resolved Oct 09, 2025, 09:40 PM UTC

From 1:14pm - 1:47PM PST, we observed that some calls were running into connection issues, and experience timeouts on operations. We identified the issue to be AWS blocking the auto scaling of the instances, causing our call servers got overloaded for a while. We have been working with AWS team on identifying the root cause to ensure this gets fixed. This issue has been resolved now.

Read the full incident report →

Notice October 2, 2025

7% calls got abnormally high latency

Detected by Pingoru: Oct 02, 2025, 08:27 PM UTC
Resolved: Oct 02, 2025, 01:30 PM UTC
Duration: —

Timeline · 1 update

resolved Oct 02, 2025, 08:27 PM UTC

Incident range: 6:30am - 11am PST impact: around 7% of the calls are having abnormally high latency post mortem: Some AWS automatic patches to our transcription clusters caused the container to lost GPU access, and used CPU for transcription, causing extra long latency there. Most calls are routed to the backup endpoint which was working fine, but around 7% did not trigger the fallback there. We are updating the containers to ensure it does not get impacted with the automatic patches.

Read the full incident report →

Minor August 26, 2025

Temporary Outbound Call Issue

Detected by Pingoru: Aug 26, 2025, 05:26 PM UTC
Resolved: Aug 26, 2025, 05:26 PM UTC
Duration: —

Timeline · 1 update

resolved Aug 26, 2025, 05:26 PM UTC

We experienced a temporary disruption where some outbound calls did not connect as expected. This was due to a telephony sip server provider issue related to elevated system load. The issue was transient and lasted from 8:00 AM to 9:30 AM PDT. The SIP server provider implemented a fix, and inbound calls have been operating normally since. We are going to roll out locally hosted SIP stack to boost reliability going onwards.

Read the full incident report →

Notice August 25, 2025

Concurrency values were not resetting as expected.

Detected by Pingoru: Aug 25, 2025, 07:15 PM UTC
Resolved: Aug 25, 2025, 07:15 PM UTC
Duration: —

Timeline · 1 update

resolved Aug 25, 2025, 07:15 PM UTC

Starting from 10:25am to 11:23am, for some customers concurrency was not resetting correctly, causing some calls to fail to connect. We are mitigating the issue by manually resetting concurrency, so the dashboard may not reflect the actual current value. We identified the root cause to be a case where audio file manipulation led to a full disk usage under a corner case scenario, and a fix is deployed.

Read the full incident report →