Elevated Error Rates
Timeline · 1 update
- resolved Apr 22, 2026, 02:47 PM UTC
Some customers may be experienced elevated API error rates. We have resolved the issue and are continuing to monitor the Zep service.
Zep Software had 18 outages in the last 2 years totaling 26h 31m of downtime — averaging 0.7 incidents per month.
There were 18 Zep Software outages since August 1, 2025 totaling 26h 31m of downtime. Each is summarised below — incident details, duration, and resolution information.
Some customers may be experienced elevated API error rates. We have resolved the issue and are continuing to monitor the Zep service.
We are currently investigating this issue.
Resolved and monitoring.
Elevated API error rates for some operations occurred over a 3 minute period. We've identified the issue and resolved it. We continue to monitor the service for stability.
We are currently investigating this issue.
A fix has been implemented and we are monitoring the results.
This incident has been resolved.
We're investigating elevated API error rates.
We have identified the issue and implementing mitigations.
Mitigated. We're monitoring.
Service is nominal. We're closing this issue.
We are currently investigating this issue.
This incident has been resolved.
We are currently investigating this issue.
A fix has been deployed and error rates have dropped to nominal. We'll continue monitoring the service for the next while.
This incident has been resolved.
We are currently investigating this issue.
This incident has been resolved.
We are currently investigating this issue.
The issue has been identified and a fix is being implemented.
This incident has been resolved.
Some customers may be experiencing delayed episode ingestion. We’ve identified the issue and are working on a fix.
The issue has been fixed. Queues are draining and they may take an hour or two to clear. We're continuing to monitor.
We've caught up on ingestion queues. This issue has been resolved.
UPDATE: This incident has been resolved. Zep context retrieval is currently impacted by an outage at our upstream search provider. We are in contact with the provider and they're rolling out a fix to address the outage.
The issue has been identified and a fix is being implemented.
This incident has been resolved.
We're currently experiencing sporadic latency spikes on our retrieval endpoints. We're working with an upstream provider to mitigate this issue.
This incident has been resolved.
We are currently investigating this issue.
This incident has been resolved.
The issue has been identified and a fix is being implemented.
We are continuing to work on a fix for this issue.
This incident has been resolved.
The issue has been identified and a fix is being implemented.
This incident has been resolved.
We are currently investigating this issue.
This incident has been resolved.
We experienced a temporary disruption to our memory service due to a database schema migration. Customers may have encountered errors when adding messages and graph data to Zep. The issue has been resolved and all data sent to Zep has been processed successfully.