ENGINEERING

Flaky test detector: quarantine reruns and file owner-assigned Linear tickets

On every GitHub Actions CI failure, reruns the failed tests in isolation to confirm flakiness.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerwebhook
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerGitHub Actions CI run failsGitHubGitHub
  • ActionRerun failed tests in isolationGitHubGitHub
  • LogicPass on retry = flaky, else real failure
  • ActionCommit quarantine skip annotation as a PRGitHubGitHub
  • ActionResolve test file CODEOWNERGitHubGitHub
  • OutputFile owner-assigned Linear ticketLinearLinear

What it does

Catches tests that fail intermittently rather than for a real regression. When CI goes red, it isolates the failing tests, reruns them, and if they pass on retry it marks them flaky, adds a skip annotation, and opens a Linear ticket routed to whoever owns that file.

When to use it

Use when flaky tests are blocking merges and developers are blindly re-running pipelines. It stops the rerun roulette by quarantining the offender and assigning a real owner, so the main branch stays green without hiding the debt.

How it works

  1. 1A GitHub Actions workflow_run failure fires the webhook with the failed job and test names.
  2. 2The flow reruns just the failed tests in isolation via a GitHub dispatch.
  3. 3A branch checks the rerun result: pass-on-retry means flaky; fail again means a real break and the flow exits.
  4. 4For flaky tests, it commits a quarantine annotation (skip + tag) on a new branch and opens a PR.
  5. 5It resolves the test file's CODEOWNER from GitHub.
  6. 6It files a Linear ticket assigned to that owner with logs, rerun history, and the quarantine PR link.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.