ENGINEERING

Flaky-Test Recovery and Auto-Unquarantine

Periodically re-runs quarantined specs in isolation; when a test passes consistently across several runs it reverses the quarantine, closes the tracking issue.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSchedule every few days
  • ActionList open quarantine issues and their specsGitLabGitLab
  • ActionRe-run each spec in isolation N timesGitLabGitLab
  • LogicRequire clean sweep across all runs
  • ActionOpen un-quarantine MR and close issueGitLabGitLab
  • OutputNotify original reporter in SlackSlack

What it does

Keeps quarantine from becoming a graveyard. On a schedule it re-executes each quarantined spec several times in isolation, and when a test passes cleanly across all attempts it removes the skip annotation, closes the GitLab tracking issue, and tells the person who filed it that the test is healthy again.

When to use it

Use it when quarantined tests accumulate and lose coverage value because nobody revisits them. This proves stability before reinstating a test and gives credit back to the original reporter.

How it works

  1. 1A schedule trigger fires every few days.
  2. 2The flow lists open `flaky::quarantine` issues and the specs they reference.
  3. 3A GitLab pipeline triggers each spec in isolation, repeated N times (default 10).
  4. 4A logic step checks for a clean sweep across all runs.
  5. 5For consistent passers it opens an MR removing the quarantine annotation and closes the tracking issue.
  6. 6A Slack DM notifies the original reporter that their test was unquarantined.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitLabRepos, MRs, pipelines, registry.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.