AI & RAG

Re-index ReadMe corpus into a vector store on every doc publish

When a ReadMe page is published or updated, it re-chunks and re-embeds that page and upserts the vectors into Postgres so the answer endpoint always grounds on the latest docs.

CategoryAI & RAG
Enginesim
Difficultyintermediate
Triggerwebhook
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerReadMe page published/updated eventHTTP webhook
  • ActionFetch current page content from ReadMeReadMeReadMe
  • ActionChunk and embed page sectionsOpenAI
  • LogicDecide upsert vs prune removed sections
  • OutputUpsert vectors and metadata into PostgresPostgreSQLPostgres

What it does

Keeps your retrieval index in sync with ReadMe. On every publish or edit event, it pulls the changed page, splits it into section-level chunks, generates embeddings, and upserts them into the Postgres vector table that your answer endpoint queries.

When to use it

Run this so your grounded answer endpoint never serves stale documentation. Essential when your docs change frequently and you cannot tolerate the index drifting from what developers actually read.

How it works

  1. 1A ReadMe publish/update event fires the webhook with the affected page identifier.
  2. 2The current page content is fetched from ReadMe so you index exactly what shipped.
  3. 3The page is split into section-level chunks preserving heading anchors for later citation.
  4. 4Each chunk is embedded by the model.
  5. 5A logic step decides upsert vs delete: removed sections are pruned, changed sections overwritten.
  6. 6The vectors and section metadata are upserted into Postgres, keeping the corpus current.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect ReadMeAPI docs, changelog, auth.
  2. 2
    Connect OpenAIModels, embeddings, files.
  3. 3
    Connect PostgresAny Postgres URL — query, write, migrate.
  4. 4
    Connect HTTP webhookTrigger any URL on agent actions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.