AI & RAG

Nightly Confluence Runbook Indexer for Datadog Knowledge Base

On a schedule, syncs Confluence runbook pages into a Postgres pgvector store and links them to Datadog monitor names so the spike-explainer flows always retrieve fresh.

CategoryAI & RAG
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerNightly schedule fires the runbook sync
  • ActionList Confluence runbook pages changed since last runConfluenceConfluence
  • LogicSkip unchanged pages, chunk the rest
  • ActionEmbed chunks and extract referenced metric/monitor namesOpenAI
  • ActionUpsert chunks and prune stale rows in pgvectorPostgreSQLPostgres
  • OutputPost index summary to Slack ops channelSlack

What it does

Keeps the runbook knowledge base current. Every night it pulls changed runbook pages from Confluence, splits them into searchable chunks, generates embeddings, and upserts them into Postgres — tagging each chunk with the Datadog monitors and metrics it references so retrieval stays precise.

When to use it

Run this alongside any of the spike-answering flows. Use it when runbooks change often and you need the RAG layer to reflect the latest fix steps without anyone re-indexing by hand. It is the data-prep backbone for the rest of the catalog.

How it works

  1. 1A nightly schedule triggers the sync.
  2. 2The flow lists Confluence pages in the runbook space updated since the last run.
  3. 3Each page is chunked, and a logic step skips unchanged pages to save embedding cost.
  4. 4An LLM embeds each new chunk and extracts referenced metric and monitor names.
  5. 5Chunks and metadata are upserted into the Postgres pgvector table, with stale rows for deleted pages removed.
  6. 6A short summary of pages added, updated, and removed is posted to the team's Slack ops channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect ConfluenceSpaces, pages, blueprints.
  2. 2
    Connect PostgresAny Postgres URL — query, write, migrate.
  3. 3
    Connect OpenAIModels, embeddings, files.
  4. 4
    Connect SlackChannels, DMs, threads, mentions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.