CONTENT CREATION

Broken and Missing Docs Screenshot Detector

Crawls all published doc pages, checks that every screenshot asset actually loads from the CDN, and pages the on-call docs owner when images return 404 or fail to render.

CategoryContent Creation
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSchedule triggers the docs crawl
  • ActionCrawl docs and collect screenshot asset URLsFirecrawl
  • ActionValidate each asset from the CDNCloudflareCloudflare
  • LogicKeep only assets that fail validation
  • OutputOpen a PagerDuty incident for broken imagesPagerDutyPagerDuty

What it does

Guards against the most visible kind of screenshot rot: images that have gone missing entirely. It crawls every published doc page, requests each screenshot asset from the CDN, and verifies it returns a valid image rather than a 404, an error page, or a zero-byte file.

When to use it

Use it after CDN migrations, bucket renames, or large docs restructures, or simply as a standing safety net. A broken screenshot looks far worse to readers than a slightly outdated one, so missing assets warrant an immediate alert.

How it works

  1. 1A schedule triggers the crawl, or it can be kicked off manually after a migration.
  2. 2The flow crawls the published docs site and collects every screenshot asset URL.
  3. 3Each asset is requested from the Cloudflare-served CDN and checked for status code, content type, and non-empty body.
  4. 4A logic step collects only the assets that failed validation.
  5. 5If any are broken, the flow opens a PagerDuty incident routed to the docs on-call with the failing URLs and the pages that reference them.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect FirecrawlCrawl, scrape, structured extract.
  2. 2
    Connect CloudflareWorkers, Pages, R2, KV — the edge stack.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.