DOCUMENT OPS
Nightly Dropbox PDF dedupe by content hash
Runs each night, hashes every PDF across the document folders, finds true duplicates (same content, different names), and moves the redundant copies to a quarantine folder…
How it runs
The automated pipeline, trigger to output.
- TriggerNightly schedule
- ActionList PDFs and compute content hashesDropbox
- LogicGroup by hash, keep oldest as canonicalPostgres
- ActionMove duplicate copies to quarantine folderDropbox
- OutputPost dedupe summary to SlackSlack
What it does
Scans all PDFs under your Dropbox document tree on a schedule, computes a content hash for each, and detects files whose bytes are identical even when their names differ. It keeps the oldest copy as canonical and moves every duplicate into a `_duplicates/` quarantine folder, then posts a summary of what it reclaimed to Slack.
When to use it
Use it when the same document gets re-scanned, re-emailed, or re-uploaded under different names and your folders are bloating with copies. Quarantine (not delete) keeps it safe to run unattended.
How it works
- 1A nightly schedule starts the run.
- 2The flow lists every PDF in the watched folders and records each file's content hash in Postgres.
- 3A grouping step finds hashes with more than one file and picks the earliest-created file as canonical.
- 4Each non-canonical duplicate is moved to `_duplicates/` in Dropbox.
- 5A Slack message summarizes how many duplicates were quarantined and the space reclaimed.
Set it up
What you configure once, before turning it on.
- 1Connect DropboxFiles and folders.
- 2Connect PostgresAny Postgres URL — query, write, migrate.
- 3Connect SlackChannels, DMs, threads, mentions.
- 4Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 5Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 6Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More Document Ops workflows
Narrate new Dropbox PDFs into MP3 audio versions
When a PDF lands in a watched Dropbox folder, extract its text and generate an ElevenLabs voice narration.
On-demand PDF narration via webhook with emailed audio link
Accepts a PDF URL through a webhook, generates an ElevenLabs narration with the requested voice, stores the MP3, and emails the requester a download link.
Triage emailed contract redlines and route by risk
When a counterparty emails a redlined contract, extracts the attachment, diffs clauses against approved templates.
Batch-narrate a Google Drive PDF folder in multiple languages
On a schedule, finds PDFs in a Google Drive folder that lack audio, then generates ElevenLabs narrations in each configured language and files them into per-language subfolders…
Executed Contract Exhibit & Initials Completeness Gate
When a signed contract lands in a Dropbox intake folder, verify every required exhibit, schedule, and initialed page is present.
Draft a negotiation brief from contract clause deviations
An agent reviews a contract against approved templates, researches each deviation.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
