AI AGENTS

Sentry Repro Attempt Logger with Confidence Scoring

On each new Sentry issue, an agent attempts reproduction, scores its confidence, logs every attempt to Postgres for analytics.

CategoryAI Agents
Enginesim
Difficultyintermediate
Triggerevent
Steps6
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSentry new-issue alert firesSentrySentry
  • ActionFetch trace and run timed shell reproShell
  • ActionCompute repro confidence score
  • ActionLog attempt record to PostgresPostgreSQLPostgres
  • LogicOpen MR only above confidence threshold
  • OutputWrite failing test and open GitLab MRGitLabGitLab

What it does

Instruments the auto-reproduction pipeline itself. For every new Sentry issue, the agent runs a repro attempt, assigns a confidence score, and writes a structured record (issue, attempt result, score, duration) to Postgres so you can measure repro success rate over time. It opens a GitLab MR with a failing test only when the confidence score clears your configured threshold.

When to use it

Use it when you are rolling out auto-reproduction and need data on how often it works before trusting it to file MRs automatically, plus an audit trail of every attempt.

How it works

  1. 1A Sentry alert fires on a newly created issue.
  2. 2The agent fetches the trace and runs a shell repro attempt, timing it.
  3. 3It computes a confidence score from the repro outcome and signal match.
  4. 4The agent logs the full attempt record to a Postgres analytics table.
  5. 5Logic gate: open an MR only if the confidence score clears the threshold.
  6. 6On a high-confidence repro it writes the failing test and opens a GitLab MR.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect SentryErrors, performance, releases.
  2. 2
    Connect ShellRun sandboxed commands inside the workspace.
  3. 3
    Connect PostgresAny Postgres URL — query, write, migrate.
  4. 4
    Connect GitLabRepos, MRs, pipelines, registry.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.