DEVOPS

Auto-quarantine flaky tests from CI failures and file a tracking issue

Watches GitHub Actions test runs for tests that pass on retry but failed on first attempt, marks them as quarantined in the suite.

CategoryDevOps
Enginesim
Difficultyintermediate
Triggerevent
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerGitHub Actions workflow run completesGitHubGitHub
  • LogicDetect tests that failed first attempt but passed on retry
  • LogicSkip tests that already have an open quarantine issue
  • ActionCommit quarantine manifest update to isolate the testGitHubGitHub
  • ActionOpen GitHub tracking issue with failure historyGitHubGitHub
  • OutputPost quarantine summary to SlackSlack

What it does

It catches tests that fail intermittently in CI, isolates them so they stop blocking merges, and opens a GitHub issue that records the offending test, its failure rate, and the runs where it flaked.

When to use it

Use it when a green build keeps getting blocked by one or two unreliable tests and your team wants those tests pulled out of the gating path automatically instead of someone manually adding `@skip` and forgetting to track it.

How it works

  1. 1A completed GitHub Actions workflow run fires the trigger with its test report attached.
  2. 2The flow parses the JUnit results and compares first-attempt outcomes against retry outcomes to find tests that failed then passed.
  3. 3A logic step checks whether each flaky test already has an open quarantine issue to avoid duplicates.
  4. 4For new offenders it commits a change to the quarantine manifest file in the repo, moving the test out of the required suite.
  5. 5It opens a GitHub issue labeled `flaky-test` with the failure history and links the commit.
  6. 6It posts a short summary to the team channel so engineers know what was isolated.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.