DEVOPS

Re-run quarantined tests and auto-unquarantine the ones that stabilize

When a quarantined test issue is labeled ready-recheck in GitHub, runs the test in isolation many times.

CategoryDevOps
Enginesim
Difficultyadvanced
Triggerevent
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerIssue labeled ready-recheck in GitHubGitHubGitHub
  • ActionRun targeted test N times via shellShell
  • LogicBranch: stable above threshold vs still flaky
  • ActionOpen PR removing skip and close issueGitHubGitHub
  • OutputReturn PR URL or recheck verdict

What it does

This workflow closes the quarantine loop by safely returning healthy tests to the suite. When an engineer labels a quarantine issue ready-recheck, it runs the named test in isolation a configurable number of times via a shell runner. If the test passes on every iteration above the confidence threshold, it opens a GitHub PR removing the skip annotation and closes the tracking issue; if it flakes again, it comments with the failure rate and keeps the quarantine in place.

When to use it

Use it after a suspected fix lands and you want objective proof a test is stable before un-skipping it, rather than trusting a single green run.

How it works

  1. 1A GitHub issue labeled ready-recheck triggers the workflow.
  2. 2A shell step runs the targeted test N times in isolation and records the pass count.
  3. 3A branch checks whether the pass rate clears the stability threshold.
  4. 4If stable, it opens a GitHub PR removing the quarantine annotation and closes the issue.
  5. 5If still flaky, it comments the measured failure rate and leaves the quarantine.
  6. 6The PR URL or the recheck verdict is returned as output.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect ShellRun sandboxed commands inside the workspace.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.