ENGINEERING

Daily error-budget burn digest across services to Confluence and Slack

Each morning it pulls remaining SLO budget and burn rate for every tracked service from Datadog, ranks them by risk, writes a dated report to Confluence.

CategoryEngineering
Enginesim
Difficultybeginner
Triggerschedule
Steps5
Setup~5 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSchedule fires each morning
  • ActionQuery Datadog budget + burn per serviceDatadogDatadog
  • LogicRank services by risk, flag below floor
  • ActionPublish dated digest page to ConfluenceConfluenceConfluence
  • OutputPost top at-risk services to SlackSlack

What it does

This workflow produces a single daily snapshot of error-budget health across the whole fleet. It reads each service's remaining budget and current burn rate from Datadog, sorts the services from most to least at risk, publishes a dated page to Confluence for the record, and drops a short summary of the riskiest services into Slack so the team starts the day knowing where the budget is tight.

When to use it

Use this when you run many services with SLOs and want a recurring reliability standup artifact instead of asking everyone to open dashboards. It is reporting, not gating, so it never blocks a deploy.

How it works

  1. 1A schedule trigger fires once each morning.
  2. 2The workflow queries Datadog for remaining budget and burn rate per tracked service.
  3. 3A logic step ranks services by risk and flags any below the warning floor.
  4. 4It writes a dated digest page to Confluence with the full table.
  5. 5It posts the top at-risk services and a link to the full page in Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect ConfluenceSpaces, pages, blueprints.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.