ENGINEERING

Budget-Recovery Freeze Lift and Deploy Gate

On each pull request, this workflow checks whether the service's Datadog error budget has recovered above the safe threshold.

CategoryEngineering
Enginesim
Difficultyadvanced
Triggerevent
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerPull request opened or updatedGitHubGitHub
  • ActionRead remaining budget and burnDatadogDatadog
  • LogicCompare to recovery threshold
  • ActionSet commit status pass or blockGitHubGitHub
  • OutputNotify PR author in SlackSlack

What it does

This closes the loop after a feature freeze. Rather than a human deciding when shipping is safe again, it gates pull requests on live budget health: it reads the current remaining error budget from Datadog and reports a GitHub commit status that either lifts the freeze (budget recovered) or holds it (still below threshold).

When to use it

Use it once a freeze has been declared and you want merges blocked until the budget genuinely recovers, with the decision driven by data and enforced in the PR itself. It complements the freeze-trigger workflow that opens the freeze in the first place.

How it works

  1. 1A GitHub pull-request event triggers the check.
  2. 2It reads the service's remaining error budget and recent burn rate from Datadog.
  3. 3A logic step compares remaining budget against the recovery threshold and decides pass or block.
  4. 4It posts a GitHub commit status: success with a "freeze lifted" note when recovered, or failure with current budget and the threshold when not.
  5. 5It notifies the PR author in Slack so they know why the gate passed or held.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitHubRepos, issues, pull requests, actions.
  2. 2
    Connect DatadogMetrics, traces, log search.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.