DATA OPS

Weekly Postgres PII Drift Audit Report

Once a week, an agent crawls your Postgres analytics database for unclassified columns, samples and classifies them for PII.

CategoryData Ops
EngineSim + Paperclip
Difficultyadvanced
Triggerschedule
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly schedule
  • ActionEnumerate all columns and diff vs. classification registryPostgreSQLPostgres
  • ActionSample and classify unclassified columnsOpenAI
  • LogicAssemble coverage stats and new findings
  • ActionWrite governance report to Notion pageNotionNotion
  • OutputPush PII-coverage gauge and metrics to DatadogDatadogDatadog

What it does

Runs a weekly full-coverage audit: an agent enumerates every column in the Postgres analytics schema, compares against a known classification registry, samples and classifies the unclassified ones, and produces a governance report in Notion plus a PII-coverage metric in Datadog to track drift over time.

When to use it

Use it for a recurring compliance artifact — a defensible weekly record showing what fraction of warehouse columns are classified and which new ones slipped through.

How it works

  1. 1A weekly schedule starts the audit.
  2. 2The agent lists all columns from information_schema and diffs them against the existing classification registry.
  3. 3For each unclassified column it samples distinct values and classifies the PII type and confidence.
  4. 4It assembles a structured report: total columns, classified percentage, and the new sensitive findings.
  5. 5The report is written to a Notion governance page.
  6. 6A coverage gauge and new-PII count are pushed to Datadog for trend dashboards and alerts.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PostgresAny Postgres URL — query, write, migrate.
  2. 2
    Connect OpenAIModels, embeddings, files.
  3. 3
    Connect NotionPages, databases, comments.
  4. 4
    Connect DatadogMetrics, traces, log search.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.