ENGINEERING

Open a Flake Issue When Datadog Flake Rate Crosses Threshold

Runs on a schedule, queries Datadog CI Visibility for each test's flake rate over the last 7 days.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerDaily schedule fires
  • ActionQuery Datadog CI Visibility for 7-day flake rate by testDatadogDatadog
  • LogicKeep tests above flake-rate threshold
  • LogicDedupe against tests with an open issue
  • ActionUpsert Linear issue with flake rate and dashboard linkLinearLinear
  • OutputPost newly flagged tests digest to SlackSlack

What it does

Instead of reacting to a single failure, this watches the trend. It pulls per-test flake rates from Datadog CI Visibility and escalates only tests that are statistically flaky over a rolling window — cutting noise from one-off failures while catching chronic offenders.

When to use it

Use it when you already ship test results to Datadog and want a data-driven quarantine policy rather than knee-jerk reactions to individual red builds.

How it works

  1. 1A scheduled trigger fires every morning.
  2. 2An action queries the Datadog CI Visibility API for flake rate by test over the last 7 days.
  3. 3A logic step filters to tests above your configured flake-rate threshold (e.g. >5%).
  4. 4A second logic step deduplicates against tests that already have an open issue.
  5. 5An action upserts a Linear issue per qualifying test, with the current flake rate, run count, and a Datadog dashboard link.
  6. 6A final output posts a digest of newly flagged tests to the team's Slack channel.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect LinearIssues, projects, cycles, triage.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.