DATA OPS

BigQuery Source Freshness Breach to PagerDuty On-Call Escalation

Watches BigQuery source-load freshness against tiered SLAs and, when a critical source breaches, opens a PagerDuty incident routed to the data on-call while logging a warning…

CategoryData Ops
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerHourly schedule
  • ActionRead latest load time per source from BigQueryGoogle BigQueryBigQuery
  • LogicClassify lateness: none / soft / hard by tier
  • ActionOpen PagerDuty incident for hard critical breachesPagerDutyPagerDuty
  • OutputPost freshness summary + soft warnings to SlackSlack

What it does

This workflow tracks freshness for BigQuery source tables and applies tiered SLAs: soft breaches get a logged warning, while hard breaches on business-critical sources page the on-call engineer through PagerDuty. It turns silent late loads into an alert with the right urgency instead of a dashboard nobody is watching at 2am.

When to use it

Use it when some data sources are mission-critical (revenue, billing, fraud) and a late load there warrants waking someone, while others only need a heads-up. Ideal for teams already running PagerDuty for on-call.

How it works

  1. 1A schedule runs hourly.
  2. 2Query BigQuery INFORMATION_SCHEMA for the latest partition/load time per tracked source.
  3. 3A logic step classifies each lateness as none, soft, or hard against the source's tier and SLA.
  4. 4For hard breaches on critical sources, open a PagerDuty incident with the source, lateness, and downstream impact.
  5. 5Post a consolidated freshness summary to Slack covering soft warnings and resolved sources.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect BigQueryDatasets, queries, schemas.
  2. 2
    Connect PagerDutyIncidents, on-call, escalations.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.