AI AGENTS

On-call agent: PagerDuty incident diagnose and dry-run plan

On a new PagerDuty incident, an agent runs read-only shell diagnostics, builds a step-by-step remediation plan, and posts it as an actionable plan for the responder to approve.

CategoryAI Agents
Enginepaperclip
Difficultyintermediate
Triggerevent
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerPagerDuty incident createdPagerDutyPagerDuty
  • ActionRun read-only shell diagnostics from allowlistShell
  • LogicCorrelate findings, rank likely causes
  • ActionBuild numbered remediation plan with dry-run
  • OutputPost plan to PagerDuty notes and SlackPagerDutyPagerDuty

What it does

When PagerDuty pages, the agent does the first 10 minutes of triage for you. It runs only safe, read-only shell checks, correlates them against the linked runbook, and produces a numbered remediation plan with a clear dry-run of each step.

When to use it

Use it for noisy services where responders waste time gathering the same diagnostics every page. The agent never mutates state — it proposes; you decide.

How it works

  1. 1A PagerDuty incident webhook delivers the alert, service, and severity.
  2. 2The agent selects the matching runbook and runs a fixed allowlist of read-only shell commands (logs, disk, process, health endpoints).
  3. 3It summarizes findings and ranks likely causes.
  4. 4It assembles a numbered remediation plan, marking each step as the exact command that would run.
  5. 5The plan and dry-run are posted to the incident's PagerDuty notes and the responder Slack channel.
  6. 6The responder approves or edits before any mutating action is taken elsewhere.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PagerDutyIncidents, on-call, escalations.
  2. 2
    Connect ShellRun sandboxed commands inside the workspace.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.