DEVOPS
Agent root-causes a cache regression and files a fix ticket
When a sustained cache hit-rate drop is confirmed, an agent investigates the offending GitLab config change, drafts a root-cause analysis with a concrete fix.
How it runs
The automated pipeline, trigger to output.
- TriggerConfirmed cache regression webhookHTTP webhook
- ActionLoad regression context and cache metricsPostgres
- ActionFetch suspect commit and CI configGitLab
- LogicAgent reasons root cause and drafts fix
- ActionFile RCA ticket assigned to owning teamLinear
- OutputPost RCA summary to SlackSlack
What it does
On a confirmed cache regression, an agent pulls the suspect config change, the cache stats before and after, and the affected jobs, then reasons about the root cause: an over-broad cache key, a missing fallback key, a reordered install step, or a base-image bump. It drafts a root-cause analysis with a proposed fix, files a Linear ticket assigned to the owning team, and posts a summary to Slack.
When to use it
Use it when you want more than an alert. The deterministic detectors say what broke; this agent explains why and turns it into actionable, owned work so the fix actually gets scheduled rather than rediscovered each sprint.
How it works
- 1A webhook fires when a confirmed regression event is emitted.
- 2The agent reads the regression context and cache metrics from Postgres.
- 3It fetches the suspect commit and surrounding CI config from GitLab.
- 4It reasons through likely causes and drafts an RCA with a concrete fix.
- 5It files a Linear ticket with the analysis and assigns the owning team.
- 6It posts a Slack summary linking the ticket and the offending change.
Set it up
What you configure once, before turning it on.
- 1Connect GitLabRepos, MRs, pipelines, registry.
- 2Connect PostgresAny Postgres URL — query, write, migrate.
- 3Connect LinearIssues, projects, cycles, triage.
- 4Connect SlackChannels, DMs, threads, mentions.
- 5Connect HTTP webhookTrigger any URL on agent actions.
- 6Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 7Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 8Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More DevOps workflows
Hugging Face Spaces idle-runtime sweep with auto-pause
On a schedule, scans all Hugging Face Spaces for ones running idle past a threshold, pauses them to stop billing, and posts a Slack summary with the estimated monthly savings.
Slack-approved pause for idle Hugging Face Spaces
On a daily scan it finds idle paid Spaces and posts an interactive Slack approval; on approve it pauses the Space and logs the decision to a GitHub issue audit trail.
Generate a weekly de-flake report and assign Linear cleanup tickets
On a weekly schedule, aggregates the current quarantine manifest and recent flake history, builds a prioritized report.
Block costly Hugging Face Space hardware upgrades in PR review
When a pull request changes a Space's hardware config, it estimates the new monthly cost and posts a GitHub PR comment that flags upgrades crossing a budget ceiling.
Auto-release tests from quarantine once they prove stable
Triggered by a webhook from a nightly stability runner, checks whether quarantined tests have passed enough consecutive runs, removes the stable ones from quarantine in GitHub.
Quarantine a test on demand from a PR comment command
Triggered when an engineer comments a quarantine command on a pull request, validates the test name, commits the quarantine change to that PR branch, opens a tracking issue.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
