DEVOPS

Auto-rollback any live Worker version on an Axiom error spike

Continuously watches Axiom for a sudden error-rate spike on the live Worker version and, when one is confirmed against a rolling baseline, instantly reverts the traffic split…

CategoryDevOps
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSchedule: every 2 min
  • ActionQuery Axiom for live rate vs. baselineAxiom
  • LogicConfirm spike over two consecutive checks
  • ActionRevert split to last-good versionCloudflareCloudflare
  • OutputPage team in Discord with detailsDiscordDiscord

What it does

Acts as a safety net for whatever Worker version is currently live, canary or not. On a tight schedule it pulls the live version's error rate from Axiom and compares it to a rolling baseline. A confirmed spike triggers an immediate revert of the Cloudflare traffic split back to the recorded last-good version, so a bad deploy or upstream failure self-heals in seconds.

When to use it

Use it as always-on protection independent of any specific rollout. It catches regressions that slip past a canary gate, plus error storms caused by config drift or a failing dependency rather than the code itself.

How it works

  1. 1A schedule fires every couple of minutes.
  2. 2Axiom returns the live version's current error rate and the trailing-hour baseline.
  3. 3A logic step flags a spike only if the rate exceeds baseline by the multiplier across two consecutive checks, suppressing single-blip noise.
  4. 4On a confirmed spike, Cloudflare reverts the split to the last-good version ID.
  5. 5The team is paged in Discord with the before/after error rates and the reverted version.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect AxiomLog streams, queries, dashboards.
  2. 2
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  3. 3
    Connect DiscordCommunity channels + voice + bots.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.