CONTENT CREATION

Generate Annotated Callout Screenshots for Doc Steps

Captures a clean UI screenshot per documented step with Browserbase, then uses image generation to overlay numbered callouts and highlight boxes before publishing the annotated…

CategoryContent Creation
Enginesim
Difficultyintermediate
Triggermanual
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerManual or scheduled run for a doc guide
  • ActionRead step definitions and selectors from ReadMeReadMeReadMe
  • ActionCapture base screenshot and element bounds per stepBrowserbase
  • ActionGenerate annotated overlay image per stepImage generation
  • OutputPublish annotated images to ReadMe pagesReadMeReadMe

What it does

Turns plain screen captures into polished, annotated walkthrough images. For each step in a documented procedure, it shoots the live UI, then generates an overlay with numbered markers, highlight boxes, and a clean caption, producing the kind of annotated screenshot that takes a designer an afternoon to build by hand.

When to use it

Use it for onboarding guides, how-to articles, and tutorials where readers need visual cues pointing at the exact element to click. Ideal when you regenerate these guides often and re-annotating each screenshot manually is the bottleneck.

How it works

  1. 1A schedule or manual run kicks off for a chosen doc guide.
  2. 2The flow reads the guide's step definitions and target UI selectors from ReadMe.
  3. 3Browserbase navigates to each step's screen and captures a base screenshot plus the bounding box of the highlighted element.
  4. 4An image generation step composes the annotated version — numbered callout, highlight ring, and caption — from the base capture and coordinates.
  5. 5The annotated images are uploaded back to the matching ReadMe page, replacing the prior versions.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect ReadMeAPI docs, changelog, auth.
  2. 2
    Connect BrowserbaseHeadless browsers, sessions, replays.
  3. 3
    Connect Image generationManaged Nano Banana image renders, metered per image.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.