ENGINEERING

Weekly Quarantined-Test Review Board

Once a week, compiles every test currently carrying the flaky-quarantine label into a Notion review board with age, flake history, and owner, and de-quarantines tests that have…

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly schedule
  • ActionList issues labeled flaky-quarantineGitHubGitHub
  • ActionEnrich with Datadog stability + ageDatadogDatadog
  • LogicFlag tests stable long enough to re-enable
  • ActionDe-quarantine: close issue, drop labelGitHubGitHub
  • OutputPublish review board to Notion + SlackNotionNotion

What it does

Produces a weekly review board of all quarantined tests so they don't rot in skip-lists forever. It pulls every open issue labeled `flaky-quarantine`, enriches each with age and recent stability from Datadog, writes a Notion board, and automatically de-quarantines tests that have passed consistently since being parked.

When to use it

Use it to close the loop on quarantine — the hard part isn't parking flaky tests, it's getting them fixed or safely re-enabled. This gives the team a standing artifact and prevents permanent test debt.

How it works

  1. 1A weekly schedule trigger starts the review.
  2. 2It lists all open GitHub issues with the `flaky-quarantine` label.
  3. 3For each, it pulls recent run stability from Datadog and the issue age.
  4. 4A branch flags tests stable for N days as ready to re-enable.
  5. 5For those, it removes the skip-list entry, closes the issue, and drops the label.
  6. 6It writes the full board (still-quarantined, re-enabled, stale) to a Notion page and pings the team in Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect DatadogMetrics, traces, log search.
  3. 3
    Connect NotionPages, databases, comments.
  4. 4
    Connect SlackChannels, DMs, threads, mentions.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.