DATA OPS

Snowflake Untagged-PII PagerDuty Escalation

Watches Snowflake for new columns missing a sensitivity tag, classifies them, and escalates high-confidence PII to PagerDuty as an incident while logging everything else…

CategoryData Ops
Enginesim
Difficultyadvanced
Triggerschedule
Steps7
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerEvery-3-hours schedule
  • ActionFind recent untagged columns in ACCOUNT_USAGESnowflakeSnowflake
  • ActionSample distinct values per columnSnowflakeSnowflake
  • ActionClassify PII type and confidenceOpenAI
  • LogicBranch by severity (high vs. lower confidence)
  • ActionOpen PagerDuty incident for high-confidence PIIPagerDutyPagerDuty
  • OutputAppend lower-confidence hits to Notion review queueNotionNotion

What it does

Detects newly added Snowflake columns that lack a sensitivity tag, classifies their likely PII content, and routes the result by severity: high-confidence sensitive columns open a PagerDuty incident, while lower-confidence ones are logged to a Notion review queue.

When to use it

Use it when unmasked PII in Snowflake is a compliance-grade risk that needs a paged on-call owner, not just a Slack ping that gets buried over a weekend.

How it works

  1. 1A schedule runs the check every few hours.
  2. 2Query ACCOUNT_USAGE.COLUMNS joined against tag references to find recently created columns with no sensitivity tag.
  3. 3Sample distinct values for each untagged column.
  4. 4Classify each for PII type and confidence.
  5. 5A severity branch splits high-confidence PII from the rest.
  6. 6High-confidence hits open a PagerDuty incident with the column path and evidence.
  7. 7Everything else appends a row to a Notion database for the next governance review.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SnowflakeWarehouses, queries, shares.
  2. 2
    Connect OpenAIModels, embeddings, files.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Connect NotionPages, databases, comments.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.