ENGINEERING

Escalate Repeat-Offender Flaky Tests to PagerDuty

When a flaky test is filed for the third time in a rolling 30-day window, escalates it as a PagerDuty incident to the owning team and bumps the Linear issue priority to urgent.

CategoryEngineering
Enginesim
Difficultyadvanced
Triggerwebhook
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWebhook: flaky-test issue created or reopenedHTTP webhook
  • ActionCount prior filings for the test in last 30 daysLinearLinear
  • LogicProceed only on third strike within window
  • ActionSet Linear issue to urgent and assign team leadLinearLinear
  • OutputTrigger PagerDuty incident with flake historyPagerDutyPagerDuty

What it does

Applies pressure to the worst chronic flakes. A test that keeps getting re-quarantined despite prior fixes is a reliability liability. This counts repeat filings per test and, on the third strike within 30 days, pages the owning team and escalates the tracking issue to urgent — making the flake impossible to keep deferring.

When to use it

Use it when a handful of tests flake repeatedly, get patched superficially, and return. It enforces accountability by turning chronic flakiness into a paged incident.

How it works

  1. 1A webhook fires each time a flaky-test issue is created or reopened.
  2. 2An action counts prior quarantine filings for the same test in the last 30 days from the issue store.
  3. 3A logic step proceeds only when the count reaches the third strike.
  4. 4An action sets the Linear issue priority to urgent and assigns the owning team lead.
  5. 5An output triggers a PagerDuty incident routed to the team's service with the flake history and links.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.