CRM

Nightly Cross-CRM Duplicate Batch Scan and Queue Refill

Runs every night to pull the full company lists from both CRMs, block-and-match them into candidate duplicate pairs, and top up the Coda review queue with any new pairs found.

CategoryCRM
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerNightly schedule
  • ActionExport active companies from Salesforce and HubSpotSalesforce
  • ActionBlock by domain and score candidate pairsOpenAI
  • LogicDrop pairs already in the Coda queue
  • ActionAppend new candidate pairs to CodaCodaCoda
  • OutputSend overnight scan digest to SlackSlack

What it does

Event-driven detection misses records created before the workflow was switched on. This nightly batch job reconciles the entire company population in Salesforce and HubSpot, finds duplicate candidate pairs across both, and adds any pairs not already in the review queue.

When to use it

Use it for the initial backfill right after an acquisition and as an ongoing safety net. It is the catch-up companion to real-time detection: anything created or imported in bulk gets swept in within 24 hours.

How it works

  1. 1A nightly schedule triggers the run.
  2. 2The flow exports active company records from both Salesforce and HubSpot.
  3. 3An OpenAI-assisted matching step blocks records by normalized domain, then scores within-block pairs to produce candidate duplicates.
  4. 4A logic step deduplicates against pairs already present in the Coda queue so reviewers never see repeats.
  5. 5New candidate pairs are appended to the Coda merge-review table.
  6. 6A Slack digest reports how many records were scanned and how many new pairs were queued overnight.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SalesforceAccounts, opportunities, cases.
  2. 2
    Connect HubSpotCRM, deals, marketing, support.
  3. 3
    Connect OpenAIModels, embeddings, files.
  4. 4
    Connect CodaDocs, packs, automations.
  5. 5
    Connect SlackChannels, DMs, threads, mentions.
  6. 6
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  7. 7
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  8. 8
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.