DATA OPS
Weekly cross-warehouse PII exposure digest
Samples new columns across both Snowflake and BigQuery each week and compiles a single ranked Notion report of unmasked-PII exposure by team and table, with a Slack summary.
How it runs
The automated pipeline, trigger to output.
- TriggerWeekly schedule
- ActionSample new Snowflake columnsSnowflake
- ActionSample new BigQuery columnsBigQuery
- LogicClassify and rank exposures by team and table
- ActionWrite ranked report to NotionNotion
- OutputPost top-exposures summary to SlackSlack
What it does
Once a week it sweeps new and changed columns in both Snowflake and BigQuery, samples and classifies them, and rolls every finding into one prioritized digest: which tables hold unmasked PII, which categories, and which owning team. It writes the full report to Notion and drops a short summary with the top exposures into Slack. This is a reporting workflow, not an enforcement one, so it never revokes access.
When to use it
Use it for governance and compliance reviews when you want a recurring, auditable picture of PII exposure across multiple warehouses without locking anything automatically.
How it works
- 1A weekly schedule triggers the sweep.
- 2Sample new columns from Snowflake.
- 3Sample new columns from BigQuery.
- 4Classify all samples and rank findings by sensitivity and row reach.
- 5Write the full ranked report to a Notion page.
- 6Post a top-exposures summary with the Notion link to Slack.
Set it up
What you configure once, before turning it on.
- 1Connect SnowflakeWarehouses, queries, shares.
- 2Connect BigQueryDatasets, queries, schemas.
- 3Connect NotionPages, databases, comments.
- 4Connect SlackChannels, DMs, threads, mentions.
- 5Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 6Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 7Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More Data Ops workflows
Snowflake column type-drift sentinel with Linear fix ticket
Snapshots the data types of every column in your tracked Snowflake schemas on a schedule, diffs against the last snapshot.
Daily BigQuery Scheduled-Query Cost Attribution to Owners
Each morning, totals the prior day's on-demand bytes-billed per scheduled query, maps each query to its owner from a label, and posts a per-owner cost leaderboard to Slack.
BigQuery dropped/renamed column sentinel with PagerDuty incident
Detects when a column is dropped or renamed in your governed BigQuery datasets and, because that breaks downstream queries hard, pages the on-call via PagerDuty and posts…
PR-time Snowflake schema contract check on dbt model changes
When a pull request changes a dbt model, it compares the model's declared output columns against the live Snowflake table it will replace and blocks the merge with a GitHub check…
Agent-triaged warehouse drift with impact analysis and runbook update
On a webhook from your warehouse audit log, an agent investigates the changed column, traces which downstream models and dashboards depend on it.
Cross-warehouse replication schema mismatch reconciler
Compares the column shape of mirrored tables between BigQuery and Snowflake and, when a replicated table has drifted out of sync between the two, opens an Asana task for the data…
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
