ENGINEERING

Auto-Quarantine Flaky Test via PR with Slack Review

When a test is confirmed flaky three times, an agent opens a GitHub pull request that marks the test as skipped with a tracking comment.

CategoryEngineering
Enginepaperclip
Difficultyadvanced
Triggerwebhook
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerFlaky-count threshold webhookHTTP webhook
  • ActionLocate test definition in repoGitHubGitHub
  • ActionOpen quarantine PR with skip annotationGitHubGitHub
  • ActionLink tracking ClickUp item in PR bodyClickUpClickUp
  • LogicAwait Slack approve or reject
  • OutputLabel PR ready or close branchGitHubGitHub

What it does

It automatically takes a repeatedly flaky test out of the blocking path. After a test trips the flaky signature a configured number of times, an agent branches the repo, annotates the test with a skip marker and a tracking reference, opens a pull request, and asks engineers in Slack to review the quarantine before merge.

When to use it

Use it when flaky tests keep blocking otherwise-green PRs and you want a fast, auditable way to sideline them without a human writing the skip by hand. Humans still approve, so nothing leaves the suite silently.

How it works

  1. 1A webhook fires when a flaky test crosses the configured failure count.
  2. 2The agent locates the test definition in the GitHub repo.
  3. 3It creates a branch and edits the test to add a skip annotation plus a link to the tracking ClickUp item.
  4. 4It opens a pull request describing the flakiness evidence.
  5. 5It posts the PR to Slack with approve and reject actions.
  6. 6On approval the PR is labeled ready to merge; rejection closes the branch.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Connect ClickUpDocs + tasks + chats in one workspace.
  4. 4
    Connect HTTP webhookTrigger any URL on agent actions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.