IT OPS

Auto-Remediate Cloudflare DNS Drift via Manifest PR

On a schedule it detects DNS records that drifted from the manifest, and for low-risk record types it reverts Cloudflare back to the manifest value.

CategoryIT Ops
Enginesim
Difficultyadvanced
Triggerschedule
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSchedule fires for remediation pass
  • ActionFetch live records (Cloudflare) and manifest (GitHub)CloudflareCloudflare
  • LogicSplit drift into auto-revertible vs human-review
  • ActionRevert low-risk records in Cloudflare to manifest valueCloudflareCloudflare
  • ActionOpen GitHub PR documenting correctionsGitHubGitHub
  • OutputPost PR link and held-back items to SlackSlack

What it does

This workflow closes the loop on drift instead of just reporting it. It finds where the live Cloudflare zone diverges from the manifest, automatically restores the manifest-defined value for safe-to-revert record types, and records every correction in a GitHub pull request for review and history.

When to use it

Use it once you trust your manifest as the true source of record and want self-healing DNS for low-risk records (such as CNAME and TXT verification records), while keeping risky changes human-gated. It removes the toil of manually re-applying values after an out-of-band edit, while leaving an auditable paper trail of what the automation touched.

How it works

  1. 1A schedule fires to start the remediation pass.
  2. 2The flow fetches live records from Cloudflare and reads the manifest from GitHub.
  3. 3A logic step computes drift and splits it into auto-revertible records versus changes that require human review.
  4. 4For auto-revertible records, an action updates Cloudflare back to the manifest value.
  5. 5A second action opens a GitHub PR summarizing each reverted record and listing the human-review items.
  6. 6The PR link and any held-back changes are posted to Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  2. 2
    Connect GitHubRepos, issues, pull requests, actions.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.