DATA OPS

Continuous prod schema baseline with PagerDuty alert on unexpected drift

Stores a known-good production schema baseline and re-checks production against it on a tight schedule.

CategoryData Ops
Enginesim
Difficultyadvanced
Triggerschedule
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerFrequent schedule
  • ActionRead live production schemaPostgreSQLPostgres
  • ActionLoad stored schema baselinePostgreSQLPostgres
  • LogicDiff vs baseline; classify expected vs out-of-band
  • OutputPage on-call via PagerDuty on unexpected driftPagerDutyPagerDuty

What it does

This workflow treats your production schema as state that should only change through deploys. It keeps a stored baseline snapshot and, on a frequent schedule, compares live production against it. Any difference that appears outside a recognized deploy window is treated as an out-of-band change and pages on-call through PagerDuty.

When to use it

Use it to catch unauthorized or accidental schema edits made directly against production — a contractor running an `ALTER TABLE` in psql, an emergency hotfix nobody captured as a migration. These are exactly the changes that break the next real migration.

How it works

  1. 1A frequent schedule triggers the check.
  2. 2Read the live production schema catalog from Postgres.
  3. 3Load the stored baseline snapshot for comparison.
  4. 4Diff live against baseline.
  5. 5Branch: if a deploy window is active or the diff is empty, refresh the baseline and exit; otherwise treat it as unexpected drift.
  6. 6Page on-call via PagerDuty with the offending objects and the exact DDL change.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PostgresAny Postgres URL — query, write, migrate.
  2. 2
    Connect PagerDutyIncidents, on-call, escalations.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.