DATA OPS

BigQuery Scheduled-Query Cost Regression Sentinel

Every morning, checks each BigQuery scheduled query's bytes-billed against its 14-day baseline and posts a ranked Slack digest of the queries whose cost regressed.

CategoryData Ops
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerDaily 7am schedule
  • ActionQuery JOBS for per-query bytes billed + 14d baselineGoogle BigQueryBigQuery
  • LogicCompute regression %, keep queries over threshold
  • ActionAttribute each flagged query to its last editorGoogle BigQueryBigQuery
  • OutputPost ranked cost-regression digest to SlackSlack

What it does

Runs a daily sweep over `INFORMATION_SCHEMA.JOBS` to compute yesterday's bytes-billed for every scheduled query, compares each against its trailing 14-day median, and flags the ones that jumped past a regression threshold. The result is one tidy Slack digest ranked by dollars of waste, with the responsible owner @-mentioned, so cost creep gets caught the same day it starts instead of at month-end invoice.

When to use it

Use this when your team relies on dozens of scheduled queries and your BigQuery bill drifts up quietly. It's the early-warning layer that turns a surprise invoice into a Monday-morning fix.

How it works

  1. 1A 7am schedule fires the sentinel.
  2. 2BigQuery query rolls up per-query bytes-billed for yesterday plus the 14-day median baseline.
  3. 3A logic step computes the percent regression and keeps only queries above the threshold (e.g. +40% and >100 GB).
  4. 4An owner-attribution step joins each flagged query to its last-modifier from the transfer-config metadata.
  5. 5Slack posts the ranked digest with estimated cost delta and owner mentions.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect BigQueryDatasets, queries, schemas.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.