AI AGENTS

On-call agent: Scheduled runbook drill with safe shell rehearsal

On a weekly schedule, an agent rehearses a chosen runbook by running its read-only steps against staging and reports drift, broken commands, and stale instructions to Slack.

CategoryAI Agents
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly schedule selects next runbook
  • ActionParse runbook, mark rehearsal-safe steps
  • ActionRun safe shell steps against stagingShell
  • LogicCompare results to expected, detect drift
  • OutputPost freshness report to SlackSlack

What it does

Keeps runbooks honest. On a schedule, the agent picks a runbook, executes only its read-only and dry-run-safe steps against staging, and flags anything that errored, drifted, or no longer matches reality.

When to use it

Use it to catch rotten runbooks before an incident does. Great for teams whose docs silently go stale between pages.

How it works

  1. 1A weekly cron schedule triggers the workflow with the next runbook in rotation.
  2. 2The agent parses the runbook into discrete steps and marks which are safe to rehearse.
  3. 3It runs the safe shell steps against the staging target and captures results.
  4. 4It compares observed behavior against the runbook's documented expectations and detects drift.
  5. 5It compiles a freshness report listing broken commands, changed outputs, and stale notes.
  6. 6The report is posted to the platform Slack channel with a per-step pass or fail.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect ShellRun sandboxed commands inside the workspace.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.