agent hive

AI & RAG

Agentic incident investigator over the runbook wiki

When a PagerDuty incident fires, an agent reads the alert, retrieves and reasons across multiple runbook pages, correlates recent GitHub changes.

CategoryAI & RAG
Enginepaperclip
Difficultyadvanced
Triggerevent
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerPagerDuty incident triggers the investigatorPagerDutyPagerDuty
  • ActionIteratively retrieve relevant runbook pages from pgvectorPostgreSQLPostgres
  • ActionPull recent deploys and merges from GitHubGitHubGitHub
  • LogicReason over evidence and assemble cited plan with OpenAIOpenAI
  • OutputPost grounded first-response plan to incident channelSlack

What it does

Spins up an autonomous first responder. Given a fresh incident, the agent searches the versioned runbook wiki, follows cross-references between pages, pulls in recent related code changes, and synthesizes a step-by-step mitigation plan grounded in cited sources rather than a single nearest-neighbor lookup.

When to use it

Use it for high-severity incidents where the right runbook is spread across several pages and the responder benefits from an agent that can chain retrieval, weigh freshness, and connect the alert to a likely recent deploy.

How it works

  1. 1A PagerDuty incident-triggered webhook delivers the alert payload.
  2. 2The agent issues iterative retrievals against the runbook vector store in Postgres, expanding from the alert symptoms.
  3. 3It queries GitHub for deploys and merges in the affected service over the last 24 hours.
  4. 4The agent reasons over the gathered evidence with OpenAI, citing each runbook page and commit it relied on.
  5. 5It posts a grounded first-response plan, with confidence and sources, to the incident's Slack channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PagerDutyIncidents, on-call, escalations.
  2. 2
    Connect PostgresAny Postgres URL — query, write, migrate.
  3. 3
    Connect GitHubRepos, issues, pull requests, actions.
  4. 4
    Connect OpenAIModels, embeddings, files.
  5. 5
    Connect SlackChannels, DMs, threads, mentions.
  6. 6
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  7. 7
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  8. 8
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.