DEVOPS

Stuck-Pipeline Lead-Time Guard with PagerDuty Escalation

Hourly, scans GitLab for pipelines stuck or pending far beyond normal, and when a production deploy's lead time blows past the SLA it triggers a PagerDuty incident so on-call can…

CategoryDevOps
Enginesim
Difficultyadvanced
Triggerschedule
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerHourly schedule
  • ActionFetch running and pending production pipelinesGitLabGitLab
  • LogicCompute elapsed time and lead-time impact
  • LogicKeep pipelines past the SLA threshold
  • OutputTrigger PagerDuty incident per breachPagerDutyPagerDuty

What it does

This protects lead time at the source by catching deploys that are stalling in the pipeline. Instead of measuring after the fact, it detects in-flight production pipelines that have exceeded their expected duration and escalates them before they wreck the team's DORA numbers.

When to use it

Use it when slow or hung CI/CD is your dominant lead-time risk and you want on-call to intervene on a blocked production deploy rather than discovering it in a weekly report.

How it works

  1. 1An hourly schedule starts the scan.
  2. 2An action queries the GitLab API for running and pending pipelines targeting production environments.
  3. 3A compute step calculates each pipeline's elapsed time from creation and projects its lead-time impact.
  4. 4A logic branch keeps only pipelines that have exceeded the configured SLA duration.
  5. 5For each breaching pipeline, the output step triggers a PagerDuty incident with the project, pipeline link, elapsed time, and owning team, routing it to the platform on-call escalation policy.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect GitLabRepos, MRs, pipelines, registry.
  2. 2
    Connect PagerDutyIncidents, on-call, escalations.
  3. 3
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  4. 4
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  5. 5
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.