DEVOPS

Hourly Datadog error-budget burn digest to Slack

Every hour, pulls each service's SLO status and burn rate from Datadog and posts a single ranked Slack digest showing remaining budget and which services are burning fast enough…

CategoryDevOps

Enginesim

Difficultybeginner

Triggerschedule

Steps5

Setup~5 min

How it runs

The automated pipeline, trigger to output.

TriggerHourly schedule
ActionFetch Datadog SLO status + burn ratesDatadog
LogicCompute remaining budget + time-to-exhaustion
LogicRank services worst-first
OutputPost ranked digest to SlackSlack

What it does

Gives the team one tidy hourly readout of error-budget health across all your tracked SLOs. It reads Datadog SLO status and short-window burn rates, computes time-to-exhaustion per service, and posts a ranked Slack message so on-call sees trouble before it becomes an incident.

When to use it

Use it when you manage many SLOs in Datadog and want proactive visibility instead of waiting for a monitor to page. Good for platform teams running a shared #reliability channel.

How it works

1A schedule fires once per hour.
2The flow fetches SLO status objects and current burn rates from Datadog for the configured SLO IDs.
3A logic step computes remaining budget and projected exhaustion time, then flags any service burning faster than the window allows.
4It sorts services worst-first and formats a compact digest with budget percentages and burn multipliers.
5The digest posts to the chosen Slack channel; if nothing is at risk it can post a brief all-clear line.

Set it up

What you configure once, before turning it on.

1
Connect DatadogMetrics, traces, log search.
2
Connect SlackChannels, DMs, threads, mentions.
3
Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
4
Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
5
Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

More DevOps workflows

Block costly Hugging Face Space hardware upgrades in PR review

When a pull request changes a Space's hardware config, it estimates the new monthly cost and posts a GitHub PR comment that flags upgrades crossing a budget ceiling.

Auto-spin a Zoom war-room when PagerDuty hits SEV-1

When a PagerDuty incident escalates to a critical severity, this workflow creates a dedicated Zoom meeting and posts the bridge link to the incident's Slack channel so responders…

Page on-call when a Hugging Face Space build is stuck or errored

Polls Hugging Face Space runtime status on a schedule and opens a PagerDuty incident when a Space sits in a build or error state past a deadline, with a Slack heads-up.

Slack-approved pause for idle Hugging Face Spaces

On a daily scan it finds idle paid Spaces and posts an interactive Slack approval; on approve it pauses the Space and logs the decision to a GitHub issue audit trail.

Hugging Face Spaces idle-runtime sweep with auto-pause

On a schedule, scans all Hugging Face Spaces for ones running idle past a threshold, pauses them to stop billing, and posts a Slack summary with the estimated monthly savings.

Open a Zoom war-room from a Datadog multi-alert storm

When a Datadog monitor crosses a critical threshold, this workflow dedupes against active incidents, and only for a genuinely new outage it creates a Zoom bridge.

Browse all DevOps →

Run it inside a business

This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Finance

Research & Trading Desk

Governance-first research, execution, and risk — every trade on the audit trail.

Operations

Internal Operations

Runbooks, on-call, vendor management — disciplined and audited.

Software

Agent Hive runs Agent Hive

The team that built Agent Hive, exactly as it runs today.

Browse all business templates →Solutions by industry →

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.

Join the Waitlist Browse all workflows →