DOCUMENT OPS
Extract Invoice Fields from Dropbox PDFs into Supabase
Watches a Dropbox folder for new PDF invoices, pulls structured fields with an OpenAI vision model, and writes clean rows to Supabase.
How it runs
The automated pipeline, trigger to output.
- TriggerNew PDF added to Dropbox folderDropbox
- ActionDownload PDF file contentsDropbox
- ActionExtract invoice fields with OpenAI structured outputOpenAI
- LogicValidate and normalize fields, route failures to review
- OutputInsert clean row into Supabase invoices tableSupabase
What it does
This workflow turns a Dropbox folder into a hands-off invoice intake pipeline. Whenever a new PDF lands in the watched folder, it downloads the file, sends it to an OpenAI model with structured-output instructions, and parses out the fields you care about — vendor name, invoice number, issue and due dates, line items, subtotal, tax, and total. The validated result is inserted as a typed row in a Supabase table, ready for reporting or reconciliation. Files that fail extraction are flagged rather than silently dropped.
When to use it
Reach for this when finance or ops receives invoices as PDFs and someone is currently retyping them into a spreadsheet or ERP. It is ideal for accounts-payable teams, bookkeeping firms handling many clients, or any back office that needs every incoming document captured as queryable data within seconds of arrival. It also works well as the first stage of a larger approval or payment flow, since the Supabase row becomes the single source of truth other automations can build on.
How it works
The workflow is triggered by Dropbox's file-added event on a specific folder. The new file is downloaded as binary, then passed to OpenAI with a JSON schema describing the invoice shape, so the model returns strict structured output instead of free text. A logic step validates required fields (invoice number and total must be present and numeric) and normalizes dates and currency. Valid records are inserted into a Supabase `invoices` table via the service-role client; records that fail validation are written to a `invoices_review` table with the raw extraction so a human can correct them. Because it runs on Hive's Sim engine, each colony processes its own documents in isolation with no shared state.
Set it up
What you configure once, before turning it on.
- 1Connect DropboxFiles and folders.
- 2Connect OpenAIModels, embeddings, files.
- 3Connect SupabaseTables, auth, storage, edge functions.
- 4Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 5Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 6Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More Document Ops workflows
Narrate new Dropbox PDFs into MP3 audio versions
When a PDF lands in a watched Dropbox folder, extract its text and generate an ElevenLabs voice narration.
On-demand PDF narration via webhook with emailed audio link
Accepts a PDF URL through a webhook, generates an ElevenLabs narration with the requested voice, stores the MP3, and emails the requester a download link.
Triage emailed contract redlines and route by risk
When a counterparty emails a redlined contract, extracts the attachment, diffs clauses against approved templates.
Batch-narrate a Google Drive PDF folder in multiple languages
On a schedule, finds PDFs in a Google Drive folder that lack audio, then generates ElevenLabs narrations in each configured language and files them into per-language subfolders…
Executed Contract Exhibit & Initials Completeness Gate
When a signed contract lands in a Dropbox intake folder, verify every required exhibit, schedule, and initialed page is present.
Draft a negotiation brief from contract clause deviations
An agent reviews a contract against approved templates, researches each deviation.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
