IT OPS

Critical DNS Record Drift to PagerDuty Incident

Frequently checks a small allowlist of business-critical DNS records against the manifest and, if a protected record is changed or deleted, opens a PagerDuty incident…

CategoryIT Ops
Enginesim
Difficultyadvanced
Triggerschedule
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerShort-interval schedule fires
  • ActionFetch watched critical records from CloudflareCloudflareCloudflare
  • ActionRead expected values from GitHub manifestGitHubGitHub
  • LogicCompare protected records; exit if all match
  • ActionOpen PagerDuty incident for drifted recordPagerDutyPagerDuty
  • OutputFile GitHub issue for trackingGitHubGitHub

What it does

This workflow watches only the DNS records you cannot afford to lose, such as apex A records, mail MX, and SPF/DKIM TXT records. It checks them often against the manifest and, on any unauthorized change, escalates immediately as a paging incident rather than a passive report.

When to use it

Use it for high-blast-radius records where drift means an outage or a deliverability failure. The hourly audit covers everything quietly; this one is the tripwire that wakes someone up. The narrow allowlist keeps it noise-free so a page always means something genuinely critical moved.

How it works

  1. 1A short-interval schedule triggers the critical-record check.
  2. 2The flow fetches the watched records from Cloudflare by name and type.
  3. 3It reads the expected values for those same records from the GitHub manifest.
  4. 4A logic step compares them and exits if every protected record matches.
  5. 5On a mismatch, an action opens a PagerDuty incident with the drifted record and its expected value.
  6. 6A final action files a GitHub issue capturing the drift for tracking and postmortem.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  2. 2
    Connect GitHubRepos, issues, pull requests, actions.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.