DATA OPS

BigQuery Slot/Cost Telemetry to Axiom with PagerDuty Escalation

Streams BigQuery job cost and slot-ms telemetry into Axiom on a tight interval and pages on-call via PagerDuty when hourly spend breaches a rolling budget guardrail.

CategoryData Ops
Enginesim
Difficultyadvanced
Triggerschedule
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerEvery 5 minutes interval
  • ActionFetch BigQuery jobs since last checkpointGoogle BigQueryBigQuery
  • ActionForward cost + slot telemetry to AxiomAxiom
  • LogicProject rolling hourly spend vs budget guardrail
  • OutputTrigger PagerDuty incident on breachPagerDutyPagerDuty

What it does

Ships fine-grained BigQuery job telemetry — bytes billed, slot-ms, query labels, and reservation — into Axiom every few minutes so you get live dashboards and retention beyond INFORMATION_SCHEMA's window. On top of that stream it enforces a rolling hourly budget: if projected spend blows through the guardrail, it pages the data on-call through PagerDuty before the damage compounds across a full day.

When to use it

Use this when you run reservations or flat-rate slots and a single runaway query or backfill can torch your budget within hours. Daily digests are too slow; this is the real-time tripwire.

How it works

  1. 1An interval trigger fires every 5 minutes.
  2. 2BigQuery returns jobs completed since the last checkpoint with cost and slot fields.
  3. 3The events are forwarded into Axiom for dashboards and long-term retention.
  4. 4A logic step projects the rolling hourly spend and compares it to the budget guardrail.
  5. 5If breached, PagerDuty triggers an incident routed to the data on-call.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect BigQueryDatasets, queries, schemas.
  2. 2
    Connect AxiomLog streams, queries, dashboards.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.