Arista CloudVision Outage History

Arista CloudVision had 39 outages in the last 2 years totaling 28h 38m of downtime — averaging 1.6 incidents per month.

There were 39 Arista CloudVision outages since December 2, 2024 totaling 28h 38m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.arista.io

Notice June 16, 2025

[Informational] CloudVision as-a-Service will no longer officially support devices running EOS 4.23, 4.24, or 4.25 trains starting January 1st, 2026

Detected by Pingoru: Jun 16, 2025, 05:03 PM UTC
Resolved: Jun 16, 2025, 04:59 PM UTC
Duration: —

Timeline · 1 update

resolved Jun 16, 2025, 05:03 PM UTC

CloudVision as-a-Service will no longer officially support devices running EOS 4.23, 4.24, or 4.25 trains starting January 1st, 2026. This is in line with our CloudVision Lifecycle Policy (https://www.arista.com/en/support/product-documentation/cloudvision-life-cycle-policy). We won't remove existing support, but new CloudVision releases won't be tested with these older EOS trains. While most CloudVision features will likely continue to function, any support for these older trains will be on a best-effort basis. We strongly recommend upgrading your EOS devices via CloudVision to a supported release.

Read the full incident report →

Notice June 12, 2025

Service disruption on all CVaaS regions

Detected by Pingoru: Jun 12, 2025, 06:46 PM UTC
Resolved: Jun 13, 2025, 02:11 AM UTC
Duration: 7h 24m

Affected: Core PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosEventsEventsEventsEventsEventsEventsEventsEventsEventsLoginLoginLoginLoginLoginLoginLoginLoginLogin

Timeline · 6 updates

Read the full incident report →

Notice June 12, 2025

Platform Slowness

Detected by Pingoru: Jun 12, 2025, 12:22 PM UTC
Resolved: Jun 12, 2025, 12:45 PM UTC
Duration: 22m

Affected: Core Platform

Timeline · 2 updates

investigating Jun 12, 2025, 12:22 PM UTC

Hello CVaaS Users, We are experiencing a temporary platform slowness from 2025-06-12 12:15 UTC. During this time there may be "Devices Stopped Streaming" events firing.
resolved Jun 12, 2025, 12:45 PM UTC

This incident has been resolved.

Read the full incident report →

Notice May 9, 2025

Event Notification system issue

Detected by Pingoru: May 09, 2025, 08:14 PM UTC
Resolved: May 09, 2025, 08:14 PM UTC
Duration: —

Affected: EventsEventsEventsEventsEventsEventsEventsEventsEvents

Timeline · 2 updates

Read the full incident report →

Minor May 5, 2025

Gap in data derived from streaming telemetry for subset of features

Detected by Pingoru: May 05, 2025, 05:41 PM UTC
Resolved: May 02, 2025, 12:00 PM UTC
Duration: —

Timeline · 1 update

Read the full incident report →

Notice March 13, 2025

Known Impact on CVaaS

Detected by Pingoru: Mar 13, 2025, 10:29 PM UTC
Resolved: Mar 13, 2025, 11:21 PM UTC
Duration: 51m

Affected: Core PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore Platform

Timeline · 2 updates

identified Mar 13, 2025, 10:29 PM UTC

We have discovered an issue that may impact a subset of users on CVaaS. We have identified the issue and working on mitigation. We're focusing on mitigation and will provide RCA as soon as possible. Thank you for your patience.
resolved Mar 13, 2025, 11:21 PM UTC

We have resolved the known issue and are reaching out to impacted customers directly. If you do not hear from us in the next hour you are not impacted by this incident. We are prioritizing working with impacted customers first, and will circle back for RCA update.

Read the full incident report →

Notice February 27, 2025

Performance degradation

Detected by Pingoru: Feb 27, 2025, 03:35 PM UTC
Resolved: Feb 27, 2025, 05:17 PM UTC
Duration: 1h 41m

Affected: Core PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformCore PlatformNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosEventsEventsEventsEventsEventsEventsEventsEventsLoginLoginLoginLoginLoginLoginLoginLogin

Timeline · 2 updates

identified Feb 27, 2025, 03:35 PM UTC

We're aware of transient spikes in processing latency affecting all regions following an upgrade. We have identified the root cause and are working on mitigation. Thank you for your patience.
resolved Feb 27, 2025, 05:17 PM UTC

This fix has been rolled out and this incident has been resolved.

Read the full incident report →

Major February 21, 2025

Temporary "DeadlineExceeded" Errors seen on various APIs

Detected by Pingoru: Feb 21, 2025, 06:40 PM UTC
Resolved: Feb 21, 2025, 06:40 PM UTC
Duration: —

Affected: Network Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosNetwork Provisioning - StudiosEventsEventsEventsEventsEventsEventsEventsEventsLoginLoginLoginLoginLoginLoginLoginLogin

Timeline · 1 update

resolved Feb 21, 2025, 06:40 PM UTC

Hello CVaaS users, You may have experienced various red toast errors with message such as "DeadlineExceeded.." when viewing, interacting, or using various UI or APIs. We sincerely apologize for this issue. The offending change has been identified, rolledback, and should no longer be present. If you are still experiencing these red toast errors, please reach out to [email protected]. Thank you.

Read the full incident report →

Major February 20, 2025

Elevated error rate

Detected by Pingoru: Feb 20, 2025, 10:52 AM UTC
Resolved: Feb 20, 2025, 11:24 AM UTC
Duration: 32m

Affected: Core Platform

Timeline · 3 updates

investigating Feb 20, 2025, 10:52 AM UTC

We are currently observing an elevated error rate on this cluster.
monitoring Feb 20, 2025, 10:59 AM UTC

Our reverse proxy was throwing errors, we've mitigated the issue and are investigating the root cause.
resolved Feb 20, 2025, 11:24 AM UTC

This incident has been resolved.

Read the full incident report →

Major January 24, 2025

ChangeControls failing to execute

Detected by Pingoru: Jan 24, 2025, 06:16 PM UTC
Resolved: Jan 24, 2025, 07:13 PM UTC
Duration: 57m

Affected: Network Provisioning - Studios

Timeline · 3 updates

investigating Jan 24, 2025, 06:16 PM UTC

We are aware that changecontrols are failing to execute. We are currently investigating the cause of this issue and working on resolution. Thank you for your patience.
monitoring Jan 24, 2025, 06:56 PM UTC

We've applied a rollback that was causing this issue. We'll continue to monitor change control usage and update.
resolved Jan 24, 2025, 07:13 PM UTC

This incident has been resolved.

Read the full incident report →

Critical January 21, 2025

New events not processing

Detected by Pingoru: Jan 21, 2025, 07:20 PM UTC
Resolved: Jan 22, 2025, 01:23 AM UTC
Duration: 6h 3m

Affected: Events

Timeline · 4 updates

investigating Jan 21, 2025, 07:20 PM UTC

We are currently experiencing issues with events processing. There are no new events generated. We will update as soon as we have more information.
monitoring Jan 21, 2025, 10:22 PM UTC

The issue has been identified and a mitigation has been implemented. We're monitoring the incident to ensure the issue is fully resolved.
monitoring Jan 21, 2025, 11:01 PM UTC

All expected event generation should be functional. We're continuing to monitor the incident.
resolved Jan 22, 2025, 01:23 AM UTC

This incident has been resolved.

Read the full incident report →

Minor January 14, 2025

Platform slowness

Detected by Pingoru: Jan 14, 2025, 05:19 PM UTC
Resolved: Jan 14, 2025, 11:29 PM UTC
Duration: 6h 10m

Affected: Core PlatformCore Platform

Timeline · 6 updates

monitoring Jan 14, 2025, 05:19 PM UTC

Some dashboards can be slow. We expect that it will improve in 30 min.
monitoring Jan 14, 2025, 06:19 PM UTC

Some services still may be slow. We expect improvement in the next hour.
monitoring Jan 14, 2025, 07:04 PM UTC

We are continuing to monitor for any further issues.
monitoring Jan 14, 2025, 10:00 PM UTC

us-central1-c region has recovered. We are still mitigating the platform slowness on us-central1-a.
monitoring Jan 14, 2025, 10:34 PM UTC

Both us-central1-a and us-central1-c regions have recovered. We are continuing to monitor.
resolved Jan 14, 2025, 11:29 PM UTC

This incident has been resolved.

Read the full incident report →

Critical January 13, 2025

Data ingestion is blocked

Detected by Pingoru: Jan 13, 2025, 06:18 AM UTC
Resolved: Jan 13, 2025, 10:23 AM UTC
Duration: 4h 5m

Affected: Core PlatformNetwork Provisioning - StudiosEventsLogin

Timeline · 5 updates

investigating Jan 13, 2025, 06:18 AM UTC

One of our databases is in read only mode. We are investigating the issue.
identified Jan 13, 2025, 08:24 AM UTC

The system is catching up now, ETA to catch up is approximately 1hr.
identified Jan 13, 2025, 09:04 AM UTC

We are continuing to work on a fix for this issue.
monitoring Jan 13, 2025, 09:37 AM UTC

Everything is back to normal, data backlogged has been ingested, we're continuing to monitor the situation and will then proceed with our postmortem analysis.
resolved Jan 13, 2025, 10:23 AM UTC

The incident has been resolved, we're still working on our postmortem analysis and will follow up with a detailed RCA soon.

Read the full incident report →

Major December 2, 2024

Ingest processing delay

Detected by Pingoru: Dec 02, 2024, 08:51 PM UTC
Resolved: Dec 02, 2024, 09:20 PM UTC
Duration: 29m

Affected: Core Platform

Timeline · 3 updates

investigating Dec 02, 2024, 08:51 PM UTC

Telemetry and application data currently being published to the platform is currently being processed slowly, we're investigating.
monitoring Dec 02, 2024, 09:04 PM UTC

Process lag has been caught up. We've identified the root cause to a disruptive configuration change. We're monitoring the situation and following up on the root cause and possible next steps to prevent this from reoccurring.
resolved Dec 02, 2024, 09:20 PM UTC

This incident has been resolved.

Read the full incident report →