DOCUMENT OPS
Daily Dropbox form batch extraction to BigQuery
On a daily schedule, processes every new scanned form accumulated in a Dropbox folder, extracts structured fields, and appends the validated rows to a BigQuery table for analytics.
How it runs
The automated pipeline, trigger to output.
- TriggerDaily schedule fires the batch run
- ActionList and download new Dropbox forms since last runDropbox
- ActionExtract fields from each form via Hugging FaceHugging Face
- LogicValidate records against expected schema
- OutputAppend valid rows to BigQuery intake tableBigQuery
What it does
Once a day this workflow sweeps a Dropbox folder for forms received since the last run, extracts each one's fields, validates them against the expected schema, and loads the clean rows into a BigQuery table so the day's intake is queryable for reporting.
When to use it
Use it when you do not need per-document real-time processing but want a reliable nightly batch that turns a pile of scanned forms into analytics-ready rows in your warehouse.
How it works
- 1A daily schedule trigger fires the run.
- 2The workflow lists files added to the Dropbox folder since the previous run and downloads each one.
- 3A Hugging Face document model extracts the fields from every form.
- 4A logic step validates each record against the expected field set and types, separating valid rows from malformed ones.
- 5Valid rows are appended to the BigQuery intake table in a single load.
- 6The count of processed, valid, and skipped forms is logged as the run summary.
Set it up
What you configure once, before turning it on.
- 1Connect DropboxFiles and folders.
- 2Connect Hugging FaceModels, datasets, spaces — the open-source hub.
- 3Connect BigQueryDatasets, queries, schemas.
- 4Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 5Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 6Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More Document Ops workflows
Narrate new Dropbox PDFs into MP3 audio versions
When a PDF lands in a watched Dropbox folder, extract its text and generate an ElevenLabs voice narration.
On-demand PDF narration via webhook with emailed audio link
Accepts a PDF URL through a webhook, generates an ElevenLabs narration with the requested voice, stores the MP3, and emails the requester a download link.
Triage emailed contract redlines and route by risk
When a counterparty emails a redlined contract, extracts the attachment, diffs clauses against approved templates.
Batch-narrate a Google Drive PDF folder in multiple languages
On a schedule, finds PDFs in a Google Drive folder that lack audio, then generates ElevenLabs narrations in each configured language and files them into per-language subfolders…
Executed Contract Exhibit & Initials Completeness Gate
When a signed contract lands in a Dropbox intake folder, verify every required exhibit, schedule, and initialed page is present.
Draft a negotiation brief from contract clause deviations
An agent reviews a contract against approved templates, researches each deviation.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
