ENGINEERING

Stale Quarantine Escalation to PagerDuty

Scans open flaky-test quarantine issues on a schedule and escalates ones aging past an SLA to PagerDuty so quarantined tests don't rot indefinitely.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly schedule
  • ActionQuery open flaky-test issues in LinearLinearLinear
  • LogicBucket issues by age vs SLA
  • ActionNudge assignee on warning-stage issuesLinearLinear
  • ActionOpen PagerDuty incident for SLA breachesPagerDutyPagerDuty
  • OutputPost quarantine backlog digest to SlackSlack

What it does

Quarantining a flaky test is meant to be temporary, but skipped tests have a way of staying skipped forever. This workflow audits open quarantine issues on a schedule and escalates any that have aged past your SLA — pinging the assignee and, if truly stale, paging the on-call lead via PagerDuty.

When to use it

Use it once you have a quarantine backlog and want a forcing function so coverage doesn't silently erode. It enforces a "fix or formally remove within N days" policy.

How it works

  1. 1A weekly schedule starts the audit.
  2. 2The flow queries Linear for all open issues labeled `flaky-test` with their creation dates.
  3. 3A logic step buckets issues by age against the SLA (e.g. warn at 14 days, escalate at 30).
  4. 4Warning-stage issues get a nudge comment to the assignee in Linear.
  5. 5SLA-breaching issues trigger a PagerDuty incident routed to the test-platform on-call.
  6. 6It posts a digest of the quarantine backlog and escalations to Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect LinearIssues, projects, cycles, triage.
  2. 2
    Connect PagerDutyIncidents, on-call, escalations.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.