DATA OPS

BigQuery Backfill Missing Partitions After Query Failure

Detects which date partitions a failed scheduled query skipped, runs a bounded backfill for exactly that window, and reports the recovered row counts to Discord.

CategoryData Ops
Enginesim
Difficultyadvanced
Triggerevent
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerScheduled query fails on partitioned loadGoogle BigQueryBigQuery
  • ActionCompute missing partition date windowGoogle BigQueryBigQuery
  • LogicWindow within auto-backfill cap?
  • ActionSchedule backfill for missing datesGoogle BigQueryBigQuery
  • OutputPost recovered row counts to DiscordDiscordDiscord

What it does

When a scheduled query that loads a partitioned table fails, the destination ends up with a gap. This workflow computes the missing date range by comparing expected run dates against partitions actually present, then issues a backfill scoped to only the gap — never a full-table reload — and confirms the recovered partitions.

When to use it

Use it for daily or hourly ingestion queries writing into date-partitioned tables, where a single missed run leaves a hole that downstream dashboards read as zero. It restores continuity without the cost and risk of reprocessing history you already have.

How it works

  1. 1A BigQuery run-failure event identifies the affected scheduled query and destination table.
  2. 2An action queries the destination's existing partitions and the transfer config's run history to find the missing date window.
  3. 3A logic guard caps the window (for example, refuses to backfill more than 14 days automatically) and routes anything larger to manual review.
  4. 4Within the cap, the workflow schedules a BigQuery backfill run for precisely the missing dates.
  5. 5After completion it reads recovered row counts per partition and posts a Discord summary table so on-call can confirm the gap is closed.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect BigQueryDatasets, queries, schemas.
  2. 2
    Connect DiscordCommunity channels + voice + bots.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.