AI & RAG

Index New Dropbox Contracts into a Clause Vector Store

When a new or updated agreement lands in a Dropbox folder, it extracts the text, splits it into clause-level chunks, embeds them with OpenAI.

CategoryAI & RAG
Enginesim
Difficultyintermediate
Triggerevent
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerFile added or updated in Dropbox folderDropboxDropbox
  • ActionDownload and extract document textDropboxDropbox
  • LogicChunk into clauses and tag metadata
  • ActionEmbed chunks with OpenAIOpenAI
  • OutputUpsert vectors into Postgres storePostgreSQLPostgres

What it does

This is the ingestion pipeline that powers every contract-question bot. Drop a signed PDF or DOCX into your Dropbox agreements folder and it becomes searchable clause-by-clause within minutes, with metadata for document name, type, and effective date.

When to use it

Run this as the always-on backbone behind any MSA retrieval workflow. Use it whenever legal or ops adds, renews, or amends an agreement and you need the knowledge base to reflect it without a manual re-index.

How it works

  1. 1A file-added or file-modified event in the watched Dropbox folder triggers the flow.
  2. 2An action downloads the document and extracts its full text.
  3. 3A logic step splits the text into clause-sized chunks and tags each with source metadata (filename, contract type, party names).
  4. 4OpenAI generates an embedding for every chunk.
  5. 5The final step upserts the chunks and vectors into the Postgres store, replacing prior versions of the same document so stale clauses never resurface.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DropboxFiles and folders.
  2. 2
    Connect OpenAIModels, embeddings, files.
  3. 3
    Connect PostgresAny Postgres URL — query, write, migrate.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.