AI AGENTS

Datadog Custom Metrics Budget Watchdog

Hourly agent that tracks Datadog custom metric counts against your contracted budget and alerts Slack with the top tag dimensions driving the overage before billing surprises hit.

CategoryAI Agents
Enginesim
Difficultyintermediate
Triggerschedule
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerHourly schedule fires
  • ActionPull Datadog custom metric counts and tag cardinalityDatadogDatadog
  • LogicCompare against budget and burn rate
  • LogicIdentify top metric and tag combinations driving overage
  • OutputSend Slack alert with offending dimensions and suggested fixSlack

What it does

Datadog bills on custom metrics, and a single high-cardinality tag can quietly multiply your billable timeseries overnight. This agent polls your custom metric usage hourly, compares it against the budget you set, and when you cross a threshold it surfaces exactly which metric names and tag keys are responsible so you can act before the monthly true-up.

When to use it

Use it if you've ever been surprised by a Datadog custom metrics overage bill. Ideal for teams on a committed custom-metrics tier who want an early-warning trip wire rather than a postmortem.

How it works

  1. 1An hourly schedule triggers the check.
  2. 2The agent pulls active custom metric counts and per-metric tag cardinality from the Datadog API.
  3. 3A logic step compares the current billable timeseries total against your configured budget and burn rate.
  4. 4If usage is on track to exceed budget, it identifies the top metric-and-tag combinations contributing to the spike.
  5. 5It sends a Slack alert listing the offending dimensions, current versus budgeted count, and a suggested tag to drop or aggregate.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect SlackChannels, DMs, threads, mentions.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.