DATA OPS

Escalate critical BigQuery SLA breaches to PagerDuty

Fires when a freshness-breach event arrives by webhook, and for tier-1 tables that are past a hard deadline it opens or updates a PagerDuty incident with the table, lateness…

CategoryData Ops
Enginesim
Difficultyintermediate
Triggerwebhook
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWebhook receives freshness-breach eventHTTP webhook
  • ActionResolve downstream view blast radiusGoogle BigQueryBigQuery
  • LogicCheck table tier and hard-deadline threshold
  • ActionOpen or update deduped PagerDuty incidentPagerDutyPagerDuty
  • OutputLink incident in on-call Slack channelSlack

What it does

It receives breach events from the watchdog and decides which ones deserve a page. Low-tier tables are logged and ignored; tier-1 tables that blow a hard SLA threshold open a PagerDuty incident enriched with the affected downstream views and minutes-late. A dedup key keyed on the table ensures repeated breach pings update the existing incident instead of paging again.

When to use it

Use it when a stale critical dataset is an actual on-call event, not just a Slack note. Keeps the pager quiet for non-critical lateness while guaranteeing tier-1 data outages wake someone.

How it works

  1. 1A webhook trigger receives a freshness-breach event from the watchdog.
  2. 2A BigQuery action resolves the downstream view blast radius for the breached table.
  3. 3A logic step checks the table's tier and whether it crossed the hard deadline.
  4. 4For qualifying breaches, a PagerDuty action opens or updates an incident using a per-table dedup key.
  5. 5A Slack message links the incident in the on-call channel as the final notification.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect BigQueryDatasets, queries, schemas.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Connect SlackChannels, DMs, threads, mentions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.