DEVOPS

Alert when a test's flake rate crosses a threshold

Runs on a schedule, computes each test's recent flake rate from Datadog CI Visibility metrics.

CategoryDevOps
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerScheduled daily check
  • ActionQuery Datadog CI flake ratesDatadogDatadog
  • LogicFilter tests over threshold, dedupe tracked
  • ActionCreate Linear ticket per offenderLinearLinear
  • OutputPost daily flake summary to SlackSlack

What it does

This template watches your test suite's reliability over time. On a schedule it queries Datadog CI Visibility for per-test flake rates over a rolling window, finds tests whose flakiness has crossed a threshold you set, and escalates them so they get owned and fixed instead of silently rotting.

When to use it

Use it when you already ship CI test results to Datadog and want a standing watchdog that catches degrading tests early, rather than waiting for them to block a release.

How it works

  1. 1A schedule trigger fires the check (e.g. every morning).
  2. 2The flow queries Datadog CI Visibility for flake rate per test across the rolling window.
  3. 3A logic step filters to tests above the threshold and dedupes ones already tracked.
  4. 4For each new offender, Linear creates a ticket with the test name, flake rate, and recent failing run links.
  5. 5A Slack message posts the day's new and worsening flakes to the team channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.