AI AGENTS

Shell-Gated Security Patch Agent with PagerDuty Escalation

On a security advisory webhook, the agent applies the minimal patched version, runs tests in a sandboxed shell, opens a GitLab MR if green.

CategoryAI Agents
Enginepaperclip
Difficultyadvanced
Triggerwebhook
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerSecurity advisory webhook receivedHTTP webhook
  • ActionApply minimal patched version in sandboxed shell and run testsShell
  • LogicBranch on test pass or fail
  • ActionOpen GitLab MR tagged as security fixGitLabGitLab
  • OutputEscalate failed patch to on-call via PagerDutyPagerDutyPagerDuty

What it does

This agent reacts to inbound security advisories. It computes the smallest version bump that clears the vulnerability, validates it in a sandboxed shell, and either ships a merge request or escalates to on-call when the patch cannot be applied cleanly.

When to use it

Use it when CVE response time matters and you need a decision in minutes, not days. It auto-resolves the easy patches and surfaces only the genuinely blocked ones to a human.

How it works

  1. 1A security advisory webhook delivers the affected package and fixed version range.
  2. 2The agent selects the minimal patched version and applies it in a sandboxed shell, then runs the test suite.
  3. 3A logic gate branches on the result.
  4. 4If tests pass, it opens a GitLab MR tagged as a security fix and returns the link.
  5. 5If tests fail, it triggers a PagerDuty incident with the package, advisory, and failing test output so on-call can intervene.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect ShellRun sandboxed commands inside the workspace.
  3. 3
    Connect GitLabRepos, MRs, pipelines, registry.
  4. 4
    Connect PagerDutyIncidents, on-call, escalations.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.