ENGINEERING

Weekly Error-Budget Digest to Confluence

Each Monday, this workflow rolls up burn rate and remaining budget across all tracked Datadog SLOs, ranks services by exhaustion risk.

CategoryEngineering
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerWeekly Monday schedule
  • ActionFetch all SLOs and burn ratesDatadogDatadog
  • LogicRank services by exhaustion risk
  • ActionPublish digest pageConfluenceConfluence
  • OutputLink digest in SlackSlack

What it does

It produces the weekly reliability report a manager would otherwise assemble by hand. It gathers every tracked SLO's remaining budget and burn rate, computes a projected exhaustion date per service, sorts the fleet from most-at-risk to healthiest, and publishes a clean Confluence page the team can review and link to.

When to use it

Use it when you run reliability reviews across many services and want a single source-of-truth page each week instead of scattered dashboards. It is reporting, not paging, so it summarizes the whole portfolio rather than reacting to one alert.

How it works

  1. 1A weekly Monday schedule starts the run.
  2. 2It fetches the list of SLOs and each one's remaining budget and trailing burn rate from Datadog.
  3. 3A logic step computes projected exhaustion per service and ranks them by risk, tagging any projected to exhaust this week.
  4. 4It renders a digest with a ranked table, week-over-week deltas, and the at-risk callouts.
  5. 5It publishes or updates the dated Confluence page and links it in Slack.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect ConfluenceSpaces, pages, blueprints.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.