AI AGENTS

Cost-Anomaly PagerDuty Trigger with Suspect Deploy

Reacts to a Datadog cost-anomaly alert, identifies the single most likely deploy that caused it.

CategoryAI Agents
Enginepaperclip
Difficultyintermediate
Triggerwebhook
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerDatadog cost-anomaly webhookDatadogDatadog
  • ActionExtract affected service and anomaly window
  • ActionQuery GitLab deploys and MRs in windowGitLabGitLab
  • LogicScore and select most likely suspect deploy
  • OutputOpen PagerDuty incident with suspect and rollbackPagerDutyPagerDuty

What it does

Turns a noisy cost-anomaly alert into an actionable incident. When Datadog flags abnormal spend on a service, the agent immediately looks at what shipped to that service, picks the single most suspicious deploy, and opens a PagerDuty incident that already names the suspect change and links the rollback path — so the on-call engineer starts with a hypothesis, not a blank page.

When to use it

Use this when cost anomalies need an owner fast and you don't want them buried in a Slack channel overnight. Best for teams that already route reliability incidents through PagerDuty and want runaway spend treated with the same urgency.

How it works

  1. 1A Datadog cost-anomaly monitor fires a webhook into the workflow.
  2. 2The agent reads the alert to extract the affected service and the anomaly window.
  3. 3It queries GitLab for deploys and MRs touching that service inside the window.
  4. 4It scores each candidate and selects the single most likely trigger.
  5. 5It opens a PagerDuty incident titled with the suspect deploy, attaches the diff and rollback link, and sets urgency from the spike size.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect DatadogMetrics, traces, log search.
  2. 2
    Connect GitLabRepos, MRs, pipelines, registry.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.