DATA OPS

Reverse-ETL Failure Webhook: Triage Severity and Page On-Call

Receives a sync-failure webhook from your reverse-ETL tool, classifies the failure by reject rate.

CategoryData Ops
Enginesim
Difficultyintermediate
Triggerwebhook
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWebhook fires on reverse-ETL run failureHTTP webhook
  • LogicCompute reject rate and classify severity
  • ActionRecord reject-rate metric to Datadog for trend trackingDatadogDatadog
  • LogicBranch: route by severity threshold
  • OutputOpen PagerDuty incident for high-severity failuresPagerDutyPagerDuty

What it does

This workflow listens for the failure callback your reverse-ETL platform fires when a sync run ends with rejected rows. It computes the reject rate against total rows, decides whether the failure is routine noise or a real incident, and routes accordingly so on-call only gets paged for genuine breakage.

When to use it

Use this when every minor sync hiccup currently pages someone at 3am. It adds a severity gate so a handful of bad emails stays quiet, but a connector outage or schema break escalates immediately.

How it works

  1. 1An incoming webhook fires when the reverse-ETL run completes with failures.
  2. 2The flow parses the payload for total rows, rejected rows, and the failure category.
  3. 3A logic step computes reject rate and compares it to your severity thresholds.
  4. 4Below threshold, it records the event to a Datadog metric for trend tracking and stops.
  5. 5At or above threshold, it opens a PagerDuty incident with the connector name, reject count, and a sample error.
  6. 6The incident links back to the failing run so the responder lands on the right place.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect DatadogMetrics, traces, log search.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.