AI AGENTS

Sentry Error Cluster to Failing-Test GitLab MR

When a Sentry issue crosses an event threshold, an agent reproduces the crash, writes a failing test that captures it, and opens a draft GitLab merge request with the repro.

CategoryAI Agents
Enginepaperclip
Difficultyadvanced
Triggerevent
Steps6
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSentry issue crosses event-count thresholdSentrySentry
  • ActionFetch stack trace, breadcrumbs, and framesSentrySentry
  • ActionClone repo and reproduce in shell sandboxShell
  • LogicContinue only if error signature matches
  • ActionWrite failing test and commit to branchShell
  • OutputOpen draft GitLab MR with repro + failing testGitLabGitLab

What it does

Watches Sentry for newly escalating error clusters and turns them into actionable engineering work. The agent reads the stack trace, reproduces the failure locally in a sandboxed shell, authors a failing test that pins the bug, and opens a draft GitLab merge request so a human can pick up a confirmed, test-backed defect instead of a raw alert.

When to use it

Use it when your team drowns in Sentry alerts and wants only reproducible, test-backed bugs to reach the MR queue. Ideal for backend services with a fast local test harness.

How it works

  1. 1A Sentry issue alert fires when an error cluster passes its event-count threshold.
  2. 2The agent pulls the full stack trace, breadcrumbs, and offending frame from the Sentry issue.
  3. 3It clones the repo and runs a shell sandbox to reproduce the exception from the captured inputs.
  4. 4Logic gate: only continue if the repro actually raises the same error signature.
  5. 5The agent writes a failing test asserting the buggy behavior and commits it to a branch.
  6. 6It opens a draft GitLab MR linking the Sentry issue, with the failing test and repro notes in the description.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SentryErrors, performance, releases.
  2. 2
    Connect GitLabRepos, MRs, pipelines, registry.
  3. 3
    Connect ShellRun sandboxed commands inside the workspace.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.