DATA OPS

Agentic Freshness Root-Cause Investigator

When a BigQuery model goes stale, an agent walks the lineage upstream to find the first broken node, classifies the likely cause.

CategoryData Ops
Enginepaperclip
Difficultyadvanced
Triggerevent
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerFreshness-breach event
  • ActionPull load times + lineage graphGoogle BigQueryBigQuery
  • LogicWalk upstream to first broken node
  • ActionInspect root node errors + schemaGoogle BigQueryBigQuery
  • ActionFile Linear ticket at root causeLinearLinear
  • OutputPost root-cause summary to SlackSlack

What it does

When a downstream model breaches freshness, a naive alert blames that model. This agent instead traverses the dbt lineage graph upward, checking each ancestor's load time, until it finds the earliest node that actually stopped, then reasons about the cause (failed source ingest, upstream model error, or schema change) and files a Linear ticket against the real root with its evidence chain.

When to use it

Use it in warehouses with deep dependency chains where the visibly-stale table is rarely the actual culprit, and analysts waste time chasing symptoms. The agent saves the manual lineage walk.

How it works

  1. 1A freshness-breach event triggers the investigation.
  2. 2The agent queries BigQuery load times and pulls the dbt lineage graph for the stale model.
  3. 3It walks ancestors upstream to locate the earliest broken node.
  4. 4It inspects that node's recent BigQuery job errors and schema-change history to classify the cause.
  5. 5It files a Linear ticket against the root node's owner with the lineage path and evidence.
  6. 6It posts a plain-language root-cause summary to Slack for the responder.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect BigQueryDatasets, queries, schemas.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.