DEVOPS

Detect Flaky Tests from CI Reruns and Open Linear Tickets

Watches GitHub Actions test results and flags any test that passed on rerun after failing on the same commit.

CategoryDevOps
Enginesim
Difficultyintermediate
Triggerevent
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerGitHub workflow_run completedGitHubGitHub
  • ActionFetch job annotations and rerun historyGitHubGitHub
  • LogicFailed then passed on same SHA?
  • ActionSearch Linear for existing flaky ticketLinearLinear
  • ActionCreate labeled Linear issue if newLinearLinear
  • OutputPost ticket link as GitHub commit statusGitHubGitHub

What it does

This workflow catches intermittent test failures the moment CI proves them flaky — a test that failed, then passed on a rerun against the identical commit SHA. Each newly confirmed flaky test gets a deduplicated Linear ticket so it stops silently eroding trust in the pipeline.

When to use it

Run this when your CI has rerun-on-failure enabled and red builds turn green without a code change. It converts the noise of "just hit rerun" into accountable, tracked work instead of letting flaky tests pile up unowned.

How it works

  1. 1A GitHub workflow_run completion event fires when a test job finishes.
  2. 2The flow fetches the job's annotations and rerun history for that commit SHA.
  3. 3A logic step compares attempts: if the same test failed then passed on the same SHA, it is flagged flaky; otherwise the run is ignored.
  4. 4It checks Linear for an existing open ticket matching the test's fully-qualified name to avoid duplicates.
  5. 5If none exists, it creates a Linear issue with the `flaky-test` label, failure rate, and a link to the failing run.
  6. 6The new ticket URL is posted back as a GitHub commit status for visibility.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.