DEVOPS

Turn a Datadog cost-anomaly monitor into a routed Slack triage thread

When a Datadog cloud-cost anomaly monitor fires, it pulls the spiking service's tags to identify the owning team, posts an alert to that team's Slack channel.

CategoryDevOps
Enginesim
Difficultybeginner
Triggerwebhook
Steps4
Setup~5 min

How it runs

The automated pipeline, trigger to output.

  • TriggerDatadog cost-anomaly monitor firesDatadogDatadog
  • ActionFetch resource tags and spend timeseriesDatadogDatadog
  • LogicResolve owning team to Slack channel
  • OutputPost routed triage thread to SlackSlack

What it does

Listens for Datadog cloud-cost anomaly monitor alerts, reads the service and team tags on the offending resource, and routes a triage message to the right team's Slack channel instead of a generic firehose. The Slack post opens a thread so the owning team can investigate in place.

When to use it

Use it when you already track cloud spend metrics in Datadog and want anomaly alerts delivered to the team that actually owns the resource, with enough context to act immediately rather than a bare "cost is high" ping.

How it works

  1. 1A Datadog cost-anomaly monitor fires and posts to the workflow webhook.
  2. 2An action enriches the alert by querying Datadog for the resource's `team` and `service` tags and the spend timeseries.
  3. 3A logic step maps the `team` tag to its Slack channel; unmapped or untagged resources fall through to a default FinOps channel.
  4. 4An output step posts a Slack message to the resolved channel with the spend graph link, the anomaly window, and the top cost-driving dimensions.
  5. 5The message starts a thread the team can reply in to claim and resolve the spike.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.