DEVOPS

Origin-Cost Spike from Cache Drop to PagerDuty Escalation

Detects when a Cloudflare cache hit-ratio decline drives an origin egress/cost spike and escalates a fully-contextualized incident to PagerDuty with the suspect config change…

CategoryDevOps
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerEvery 30 minutes
  • ActionRead hit ratio + origin volume/bandwidth (Cloudflare)CloudflareCloudflare
  • LogicRatio drop AND origin-cost surge over threshold?
  • ActionIdentify probable-cause edge-config commitGitLabGitLab
  • ActionOpen enriched PagerDuty incidentPagerDutyPagerDuty
  • OutputMirror summary to cost-watch channelSlack

What it does

This workflow watches the downstream consequence of a cache regression — origin requests and egress cost — not just the ratio itself. Every 30 minutes it reads cache hit ratio and origin request volume from Cloudflare, and when a ratio drop coincides with an origin-load surge above your dollar threshold, it opens a PagerDuty incident enriched with the cost delta and the GitLab commit most likely responsible.

When to use it

Use it when the business impact you care about is the origin bill and SLA, and you want paging reserved for regressions that actually cost money — filtering out harmless ratio dips that don't move origin load.

How it works

  1. 1Schedule fires every 30 minutes.
  2. 2Cloudflare returns cache hit ratio plus origin request volume and bandwidth.
  3. 3A logic step confirms both a ratio drop and an origin-cost surge crossing the threshold.
  4. 4GitLab identifies the most recent edge-config commit as the probable cause.
  5. 5PagerDuty opens an incident with the cost delta, ratio change, and suspect commit.
  6. 6Slack mirrors the incident summary to the cost-watch channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  2. 2
    Connect GitLabRepos, MRs, pipelines, registry.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Connect SlackChannels, DMs, threads, mentions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.