DOCUMENT OPS
Verified form extraction with S3 archive and Postgres index
Extracts fields from scanned forms in Dropbox, archives the original to S3 only after the data is verified into a Postgres index.
How it runs
The automated pipeline, trigger to output.
- TriggerNew scanned form in DropboxDropbox
- ActionDownload file from DropboxDropbox
- ActionExtract fields and confidence via Hugging FaceHugging Face
- LogicVerify required fields and confidence
- ActionInsert verified record into Postgres indexPostgres
- OutputArchive original to S3 and stamp archive keyAWS S3
What it does
This workflow couples extraction with durable archival: it pulls fields from a scanned form, indexes the structured record in Postgres, and only once that write succeeds does it copy the original document to an S3 archive bucket and back-link the archive path onto the record.
When to use it
Use it when scanned documents must be retained for compliance or audit and you need a guarantee that nothing is archived as processed unless its data actually made it into the system of record.
How it works
- 1A new scanned form in Dropbox triggers the run.
- 2The file is downloaded from Dropbox.
- 3A Hugging Face model extracts the structured fields with confidence scores.
- 4A logic step confirms the required fields are present and meet the confidence threshold.
- 5The verified record is inserted into the Postgres document index.
- 6On a successful insert, the original file is uploaded to the S3 archive and the record is updated with the resulting archive key.
Set it up
What you configure once, before turning it on.
- 1Connect DropboxFiles and folders.
- 2Connect Hugging FaceModels, datasets, spaces — the open-source hub.
- 3Connect PostgresAny Postgres URL — query, write, migrate.
- 4Connect AWS S3Buckets, objects, signed URLs.
- 5Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 6Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 7Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More Document Ops workflows
Narrate new Dropbox PDFs into MP3 audio versions
When a PDF lands in a watched Dropbox folder, extract its text and generate an ElevenLabs voice narration.
On-demand PDF narration via webhook with emailed audio link
Accepts a PDF URL through a webhook, generates an ElevenLabs narration with the requested voice, stores the MP3, and emails the requester a download link.
Triage emailed contract redlines and route by risk
When a counterparty emails a redlined contract, extracts the attachment, diffs clauses against approved templates.
Batch-narrate a Google Drive PDF folder in multiple languages
On a schedule, finds PDFs in a Google Drive folder that lack audio, then generates ElevenLabs narrations in each configured language and files them into per-language subfolders…
Executed Contract Exhibit & Initials Completeness Gate
When a signed contract lands in a Dropbox intake folder, verify every required exhibit, schedule, and initialed page is present.
Draft a negotiation brief from contract clause deviations
An agent reviews a contract against approved templates, researches each deviation.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
