AI & RAG

Agentic on-call investigator with live telemetry plus postmortems

An agent investigates a remediation question by searching past postmortems and pulling live metrics from Datadog.

CategoryAI & RAG
Enginepaperclip
Difficultyadvanced
Triggerchat
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerResponder asks the agent to investigate a symptom in SlackSlack
  • ActionRetrieve relevant postmortems from the Postgres indexPostgreSQLPostgres
  • ActionQuery Datadog for current metrics cited in the postmortemsDatadogDatadog
  • LogicDecide if historical remediation applies or needs adaptation
  • ActionOpen a Linear ticket with recommendation and evidenceLinearLinear
  • OutputReply in Slack with cited recommendation and ticket linkSlack

What it does

Goes beyond document retrieval: an agent reasons over both the postmortem knowledge base and current system telemetry to recommend a remediation tailored to the present state, cites the postmortems it relied on, and files a tracking ticket so the action isn't lost.

When to use it

Use it for thornier incidents where the right fix depends on what the system is doing right now, not just what worked last time. Best when responders want a single agent that correlates 'how we fixed it before' with 'what the dashboards show now'.

How it works

  1. 1A responder asks the agent in Slack to investigate a symptom.
  2. 2The agent retrieves relevant postmortems from the Confluence-backed Postgres index.
  3. 3It queries Datadog for the current metrics named in those postmortems to confirm the failure mode matches.
  4. 4It reasons over both sources, deciding whether the historical remediation still applies or needs adaptation.
  5. 5It opens a Linear ticket capturing the recommended remediation and evidence.
  6. 6It replies in Slack with the cited recommendation and the ticket link.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SlackChannels, DMs, threads, mentions.
  2. 2
    Connect PostgresAny Postgres URL — query, write, migrate.
  3. 3
    Connect DatadogMetrics, traces, log search.
  4. 4
    Connect LinearIssues, projects, cycles, triage.
  5. 5
    Connect ConfluenceSpaces, pages, blueprints.
  6. 6
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  7. 7
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  8. 8
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.