ENGINEERING

Auto-Skip a Confirmed Flaky Test via PR and Notify the Author

When a test is confirmed flaky, opens a GitHub pull request that adds a skip annotation referencing the tracking issue.

CategoryEngineering
Enginesim
Difficultyadvanced
Triggerwebhook
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWebhook: test confirmed flakyHTTP webhook
  • ActionResolve last author and file path via GitHub git blameGitHubGitHub
  • LogicBuild skip annotation linked to tracking issue
  • ActionOpen GitHub PR adding the skip markerGitHubGitHub
  • OutputSlack DM the author to review the auto-skipSlack

What it does

Closes the loop between detection and remediation. Once a test is confirmed flaky, this bot writes the actual code change to skip it — opening a real PR that adds a skip/xfail marker linked to the tracking issue — and pings the engineer who last touched the test so the unblock is reviewed, not rubber-stamped.

When to use it

Use it when flaky tests are blocking merges and you want to unblock the pipeline fast without permanently disabling coverage silently. Every skip is a reviewable PR tied to an issue.

How it works

  1. 1A webhook fires when a flaky test is confirmed (from an upstream detector or a manual label).
  2. 2An action uses git blame via the GitHub API to find the test's last author and file location.
  3. 3A logic step builds the skip annotation referencing the tracking issue number.
  4. 4An action opens a GitHub PR on a new branch adding the skip marker.
  5. 5A final output sends a Slack DM to the author with the PR link asking them to review the temporary skip.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Connect HTTP webhookTrigger any URL on agent actions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.