DEVOPS

Triage flaky release blockers and page the owning team

Triggered when a release-blocking pipeline fails, an agent decides whether the failure is a known flaky test or a real regression, quarantines confirmed flakes.

CategoryDevOps
Enginepaperclip
Difficultyadvanced
Triggerevent
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerRelease pipeline fails
  • ActionAgent classifies each failure: flaky vs regression
  • LogicRoute flakes and regressions to separate paths
  • ActionQuarantine flakes and open GitHub tracking issueGitHubGitHub
  • ActionTrigger PagerDuty incident for real regressionsPagerDutyPagerDuty
  • OutputPost triage decision to SlackSlack

What it does

When a release pipeline goes red, this workflow distinguishes a flaky test from a genuine regression. Confirmed flakes get quarantined so the release can proceed; real failures page the owning on-call so a human investigates.

When to use it

Use it on release branches where a red pipeline is expensive and you need fast, defensible triage. It avoids paging humans for known-flaky noise while making sure actual regressions still wake someone up.

How it works

  1. 1A failed release pipeline fires the trigger with the failing test list and logs.
  2. 2An agent reviews each failure against historical flake data and the diff to classify it as flaky or a real regression.
  3. 3A logic step routes flakes and regressions down separate paths.
  4. 4For flakes, the agent commits a quarantine entry in GitHub and opens a tracking issue.
  5. 5For real failures, it triggers a PagerDuty incident assigned to the owning team.
  6. 6It posts the triage decision and reasoning to the release channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect PagerDutyIncidents, on-call, escalations.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.