CHATBOTS

Datadog "Which Deploy Broke This?" Correlator

An engineer asks the bot which recent deploy likely caused a Datadog regression, and it correlates the metric's degradation timestamp against GitHub deploy events to name…

CategoryChatbots
Enginepaperclip
Difficultyadvanced
Triggerchat
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerEngineer asks which deploy caused a regressionSlack
  • ActionQuery metric time series to find inflection pointDatadogDatadog
  • ActionFetch deploy events and release tags in windowGitHubGitHub
  • LogicScore and rank deploys by proximity and service match
  • OutputReply with ranked culprits and linksSlack

What it does

Answers the single most common on-call question: which change broke the graph. The bot takes a Datadog monitor or metric and the time things went wrong, then mathematically aligns the degradation window with GitHub deploy markers to point at the deploy that best explains the regression.

When to use it

Use it when a metric clearly degraded but the cause is not obvious and multiple teams shipped recently. Best for services with frequent deploys where eyeballing the timeline is error-prone.

How it works

The operator messages the bot with a metric or monitor and an approximate "it got bad around" time. The workflow queries Datadog for the metric time series to pinpoint the actual inflection point. It then fetches GitHub deploy events and release tags spanning that window. A logic step scores each deploy by temporal proximity and whether it touched the alerting service, and filters out unrelated changes. The bot replies in Slack with a ranked culprit list, the confidence for each, and direct links to the suspect deploys and the metric graph.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SlackChannels, DMs, threads, mentions.
  2. 2
    Connect DatadogMetrics, traces, log search.
  3. 3
    Connect GitHubRepos, issues, pull requests, actions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.