Trigger.dev Outage History

Trigger.dev is up right now

Trigger.dev had 18 outages in the last 2 years totaling 97h 43m of downtime — averaging 0.7 incidents per month.

There were 18 Trigger.dev outages since December 1, 2025 totaling 97h 43m of downtime. Each is summarised below — incident details, duration, and resolution information.

Source: https://status.trigger.dev

Minor April 21, 2026

DNS in us-east-1 is degraded

Detected by Pingoru
Apr 21, 2026, 03:40 PM UTC
Resolved
Apr 23, 2026, 05:15 PM UTC
Duration
2d 1h
Affected: Current status by service (Trigger.dev cloud)Current status by service (Trigger.dev OpenTelemetry)
Timeline · 3 updates

Read the full incident report →

Minor April 1, 2026

Realtime is behind

Detected by Pingoru
Apr 01, 2026, 12:00 PM UTC
Resolved
Apr 01, 2026, 06:57 PM UTC
Duration
6h 57m
Affected: Current status by service (Realtime)
Timeline · 2 updates
  1. investigating Apr 01, 2026, 12:00 PM UTC

    Realtime metadata updates and streaming v1 are not live, they've fallen behind. We're trying to remediate this.

  2. resolved Apr 01, 2026, 06:57 PM UTC

    Realtime is back to live. We're really sorry for this extended period of large delays. The service couldn't keep up the number of runs being processed and was falling further behind. We have made some configuration changes and upgraded it so it can cope with a higher throughput of runs. If you were using our React hooks that just did streaming, they were unimpacted by this.

Read the full incident report →

Minor March 16, 2026

Intermittent DNS issues in ...

Detected by Pingoru
Mar 16, 2026, 09:20 PM UTC
Resolved
Mar 17, 2026, 12:07 AM UTC
Duration
2h 47m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Mar 16, 2026, 09:20 PM UTC

    From user runs we're seeing an increase in DNS related issues like: Error: getaddrinfo ENOTFOUND Error: getaddrinfo EAI_AGAIN We're investigating why this is happening.

  2. resolved Mar 17, 2026, 12:07 AM UTC

    DNS service is now back to fully operational. Increased traffic combined with a routine infrastructure rollout caused intermittent DNS resolution failures. We've tuned our DNS configuration to resolve the issue and are working on longer-term improvements to prevent recurrence.

Read the full incident report →

Minor March 6, 2026

Dashboard and telemetry deg...

Detected by Pingoru
Mar 06, 2026, 02:16 PM UTC
Resolved
Mar 06, 2026, 03:15 PM UTC
Duration
59m
Affected: Current status by service (Trigger.dev cloud)Current status by service (Trigger.dev OpenTelemetry)
Timeline · 2 updates
  1. investigating Mar 06, 2026, 02:16 PM UTC

    The runs list and detail pages in the dashboard are currently degraded due to an ongoing issue with our ClickHouse DB. We're also observing some logs and span ingestion failures. We're currently investigating. Run executions are not impacted.

  2. resolved Mar 06, 2026, 03:15 PM UTC

    The issue has been resolved. Dashboard and telemetry are now fully operational.

Read the full incident report →

Minor March 1, 2026

Elevated dequeue times in u...

Detected by Pingoru
Mar 01, 2026, 01:39 AM UTC
Resolved
Mar 01, 2026, 02:36 AM UTC
Duration
57m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Mar 01, 2026, 01:39 AM UTC

    Dequeues are slower than normal in us-east-1. Runs are still executing, but they are slower to start. We’re investigating the issue.

  2. resolved Mar 01, 2026, 02:36 AM UTC

    The issue is now resolved and dequeue times are back to normal. Mainly free-tier runs were affected. This was caused by a spike in the free-tier run volume.

Read the full incident report →

Minor January 23, 2026

Intermittent DNS failures a...

Detected by Pingoru
Jan 23, 2026, 01:37 AM UTC
Resolved
Jan 23, 2026, 04:19 AM UTC
Duration
2h 42m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Jan 23, 2026, 01:37 AM UTC

    We are experiencing intermittent issues that may cause some task runs to fail. Automatic retries are in place and should recover most affected runs. Our team is actively working on resolution.

  2. resolved Jan 23, 2026, 04:19 AM UTC

    Full service has been restored. Task execution is back to normal. If you experienced failures between 01:37 and 04:19 UTC, those runs can be retried successfully now. What happened: During a period of high activity, a backlog of completed runs built up faster than our cleanup processes could handle, which put pressure on internal services and caused intermittent failures. What we did: We spun up additional cleanup capacity to clear the backlog and restore normal operation. What we're doing next: We're increasing resource limits on critical internal services and adding better alerting so we can catch this earlier if it happens again.

Read the full incident report →

Major January 21, 2026

Some schedules have stopped...

Detected by Pingoru
Jan 21, 2026, 10:56 AM UTC
Resolved
Jan 21, 2026, 11:15 AM UTC
Duration
19m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Jan 21, 2026, 10:56 AM UTC

    We had a brief outage earlier which affected a subset of schedules. We are working on a fix to get them going again.

  2. resolved Jan 21, 2026, 11:15 AM UTC

    All schedules have been fully restored.

Read the full incident report →

Minor January 16, 2026

Issue with task logs

Detected by Pingoru
Jan 16, 2026, 05:03 PM UTC
Resolved
Jan 16, 2026, 05:44 PM UTC
Duration
41m
Affected: Current status by service (Trigger.dev OpenTelemetry)
Timeline · 2 updates
  1. investigating Jan 16, 2026, 05:03 PM UTC

    Our task log storage system is currently overloaded and we are working on bringing up additional capacity, but in the meantime some logs may be lost.

  2. resolved Jan 16, 2026, 05:44 PM UTC

    We have finally been able to provision additional capacity and logs are working again. A full post-mortem will follow.

Read the full incident report →

Minor January 2, 2026

Dashboard runs list is delayed

Detected by Pingoru
Jan 02, 2026, 06:17 PM UTC
Resolved
Jan 02, 2026, 09:16 PM UTC
Duration
2h 59m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Jan 02, 2026, 06:17 PM UTC

    Our run sync to clickhouse process is currently delayed. The runs list in the dashboard will be behind but runs are executing as normal.

  2. resolved Jan 02, 2026, 09:16 PM UTC

    The runs list is now up to do and syncing live updates again.

Read the full incident report →

Major January 1, 2026

Batches are slow to process

Detected by Pingoru
Jan 01, 2026, 10:00 AM UTC
Resolved
Jan 01, 2026, 03:10 PM UTC
Duration
5h 10m
Affected: Current status by service (Trigger.dev API)
Timeline · 2 updates
  1. investigating Jan 01, 2026, 10:00 AM UTC

    There is a backlog in processing batchTrigger and batchTriggerAndWait calls. This means runs are being created slower than normal for these. We're investigating why this is happening

  2. resolved Jan 01, 2026, 03:10 PM UTC

    The new batch concurrency processing defaults have brought the processing queue down to zero

Read the full incident report →

Minor December 17, 2025

Runs list is delayed

Detected by Pingoru
Dec 17, 2025, 03:12 PM UTC
Resolved
Dec 17, 2025, 03:30 PM UTC
Duration
18m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Dec 17, 2025, 03:12 PM UTC

    Runs are not syncing to our clickhouse instances fast enough and so there is a delay in data in the runs list dashboard. Runs are operating normally.

  2. resolved Dec 17, 2025, 03:30 PM UTC

    Runs are now syncing live and the dashboard is back to normal.

Read the full incident report →

Major December 17, 2025

Realtime streams v2 is degr...

Detected by Pingoru
Dec 17, 2025, 07:28 AM UTC
Resolved
Dec 17, 2025, 08:16 AM UTC
Duration
48m
Affected: Current status by service (Realtime)
Timeline · 2 updates
  1. investigating Dec 17, 2025, 07:28 AM UTC

    Writes and reads to Realtime streams v2 are currently suffering an outage and we're investigating.

  2. investigating Dec 17, 2025, 08:16 AM UTC

    Fix has been applied and realtime streams v2 is fully operational.

Read the full incident report →

Minor December 16, 2025

Dashboard issues due to Cli...

Detected by Pingoru
Dec 16, 2025, 09:02 PM UTC
Resolved
Dec 16, 2025, 10:42 PM UTC
Duration
1h 40m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Dec 16, 2025, 09:02 PM UTC

    We’re seeing a percentage of queries failing from ClickHouse Cloud which powers some pages in the dashboard, like Tasks graphs, Runs page and the logs. We’re talking to their team to try resolve this.

  2. investigating Dec 16, 2025, 10:42 PM UTC

    Operations have returned to normal, we're continuing to investigate the root cause and will provide more detail as we know more.

Read the full incident report →

Minor December 16, 2025

Dashboard is degraded

Detected by Pingoru
Dec 16, 2025, 10:43 AM UTC
Resolved
Dec 16, 2025, 11:05 AM UTC
Duration
22m
Affected: Current status by service (Trigger.dev cloud)Current status by service (Trigger.dev API)Current status by service (Trigger.dev OpenTelemetry)
Timeline · 2 updates
  1. investigating Dec 16, 2025, 10:43 AM UTC

    The dashboard is currently degraded due to an ongoing issue with our ClickHouse DB. We're currently investigating further. Run executions are not impacted.

  2. resolved Dec 16, 2025, 11:05 AM UTC

    The issue in ClickHouse is now resolved. The dashboard is back to being fully operational. The root cause was a faulty node in the ClickHouse cluster which we couldn't kill. We're speaking to the ClickHouse Cloud team to find out why it happened.

Read the full incident report →

Minor December 4, 2025

Runs list is lagging behind

Detected by Pingoru
Dec 04, 2025, 03:16 PM UTC
Resolved
Dec 04, 2025, 03:35 PM UTC
Duration
19m
Affected: Current status by service (Trigger.dev cloud)
Timeline · 2 updates
  1. investigating Dec 04, 2025, 03:16 PM UTC

    The runs list is currently showing stale data. Runs are executing like normal. Our replication process from postgresql to our Clickhouse instance is falling behind and so the dashboard will be showing stale run data. We're investigating.

  2. investigating Dec 04, 2025, 03:35 PM UTC

    The runs list has all caught up and the dashboard is no longer displaying stale data. We're continuing to investigate the root cause of this issue

Read the full incident report →

Minor December 1, 2025

open telemetry logs and spa...

Detected by Pingoru
Dec 01, 2025, 02:08 PM UTC
Resolved
Dec 02, 2025, 11:18 AM UTC
Duration
21h 10m
Affected: Current status by service (Trigger.dev OpenTelemetry)
Timeline · 2 updates
  1. investigating Dec 01, 2025, 02:08 PM UTC

    We are currently having issues with our ingestion of open telemetry logs and spans after rolling out a fix for the issue that was happening over the weekend with clickhouse. We're investigating

  2. investigating Dec 02, 2025, 11:18 AM UTC

    We've published a full post-mortem on this incident here: https://trigger.dev/blog/clickhouse-too-many-parts-postmortem

Read the full incident report →