DEVOPS

Edge Canary: PagerDuty-Triggered Instant Worker Rollback

When a PagerDuty incident fires during an active canary, instantly rolls Cloudflare traffic back to the last known-good Worker version.

CategoryDevOps
Enginesim
Difficultyadvanced
Triggerwebhook
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerPagerDuty incident triggered webhookPagerDutyPagerDuty
  • LogicIs a canary deployment active?
  • ActionRoll Cloudflare traffic back to stable versionCloudflareCloudflare
  • ActionVerify burn rate dropped in HoneycombHoneycomb
  • LogicRecovered or still burning?
  • OutputUpdate PagerDuty incident and notify SlackPagerDutyPagerDuty

What it does

Turns a PagerDuty page into an automatic edge rollback. If an incident is triggered while a canary deployment is live, this workflow immediately shifts Cloudflare traffic back to 100% on the previously stable Worker version, then verifies in Honeycomb that the error budget burn actually stops before declaring recovery on the incident.

When to use it

Use for high-blast-radius edge services where the right reaction to any page during a canary is "revert first, investigate later." It compresses the revert-and-confirm loop into seconds without a human at the keyboard.

How it works

  1. 1A PagerDuty webhook fires on a triggered incident tagged to the edge service.
  2. 2A logic branch confirms a canary deployment is currently active (otherwise it exits).
  3. 3The workflow calls Cloudflare to roll the gradual deployment back to 100% stable version.
  4. 4It waits, then queries Honeycomb to confirm the burn rate has dropped post-rollback.
  5. 5A second logic branch decides recovered versus still-burning.
  6. 6It posts a status note back to the PagerDuty incident and mirrors the outcome to Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PagerDutyIncidents, on-call, escalations.
  2. 2
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  3. 3
    Connect HoneycombDistributed traces and queries.
  4. 4
    Connect SlackChannels, DMs, threads, mentions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.