ENGINEERING

Weekly Flaky-Test Quarantine Digest with Auto-Filed Cleanup Tasks

Aggregates the week's flaky-test signals, posts a ranked digest to Slack, and files a ClickUp cleanup task for each newly quarantined test.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly scheduled digest run
  • ActionQuery flaky occurrences from metrics storePostgreSQLPostgres
  • LogicRank tests and split new vs existing quarantine
  • ActionPost ranked flakiness digest to SlackSlack
  • OutputFile ClickUp tasks for newly quarantined testsClickUpClickUp

What it does

This workflow gives engineering a single weekly view of CI instability. It rolls up all flaky-test signals recorded over the past week from your metrics store, ranks tests by flake frequency, posts a digest to Slack, and files a ClickUp cleanup task for every test that newly entered quarantine that week.

When to use it

Use it when per-run alerts are too noisy and you want one calm, recurring report that surfaces the top offenders and turns the new ones into tracked work. Ideal for a weekly engineering health review or sprint planning input.

How it works

  1. 1A scheduled trigger runs once a week.
  2. 2The flow queries the Postgres metrics store for flaky occurrences in the trailing seven days and ranks tests by frequency and affected pipelines.
  3. 3A branch separates tests already in quarantine from those newly crossing the threshold this week.
  4. 4It posts a ranked Slack digest with totals, top offenders, and trend versus the prior week.
  5. 5For each newly quarantined test it files a ClickUp cleanup task so nothing new slips through untracked.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect PostgresAny Postgres URL — query, write, migrate.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Connect ClickUpDocs + tasks + chats in one workspace.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.