ENGINEERING

Auto-Open a Skip PR for Repeat-Offender Flaky Tests

When a test crosses the flake-count limit it opens a draft PR annotating the test as skipped or quarantined, then files a ClickUp follow-up to unskip after a real fix.

CategoryEngineering
Enginesim
Difficultyadvanced
Triggerevent
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerGitHub check run completedGitHubGitHub
  • LogicCompute rolling flake count past repeat-offender limit
  • ActionBranch and apply quarantine annotationGitHubGitHub
  • ActionOpen draft skip PR referencing failing runsGitHubGitHub
  • OutputFile ClickUp follow-up to unskip after fixClickUpClickUp

What it does

This workflow stops a chronically flaky test from blocking everyone's builds. Once a test's flake count crosses the limit, it opens a draft GitHub pull request that adds a skip or quarantine annotation to that test, then files a ClickUp follow-up task so the skip is temporary and someone is on the hook to fix and re-enable it.

When to use it

Use it when a single flaky test is repeatedly red-X-ing PRs and the team needs immediate relief without losing track of the debt. It suits repos with a quarantine annotation convention and a culture of fixing rather than permanently skipping.

How it works

  1. 1A GitHub check-run event fires when a CI run finishes.
  2. 2The flow records the result and computes the rolling flake count for each test that failed then passed.
  3. 3A branch fires only when a test exceeds the repeat-offender limit and is not already quarantined.
  4. 4It creates a branch, applies the skip or quarantine annotation, and opens a draft PR referencing the failing runs.
  5. 5It files a ClickUp follow-up task to fix the root cause and remove the skip, assigned to the test's owner.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect ClickUpDocs + tasks + chats in one workspace.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.