IT OPS

Edge Box Recovery Auto-Resolve and Thread Close

When a previously-silent edge box resumes sending heartbeats, this resolves its PagerDuty incident, posts an all-clear to the Discord triage thread, and closes the incident record.

CategoryIT Ops
Enginesim
Difficultyintermediate
Triggerwebhook
Steps5
Setup~15 min

How it runs

The automated pipeline, trigger to output.

  • TriggerHeartbeat arrives from a down deviceHTTP webhook
  • LogicConfirm open incident and compute downtime
  • ActionResolve matching PagerDuty incidentPagerDutyPagerDuty
  • OutputPost all-clear and archive Discord threadDiscordDiscord
  • ActionClose incident row in PostgresPostgreSQLPostgres

What it does

Closes the loop on an outage. The moment a device that was flagged down sends a fresh heartbeat, the workflow computes total downtime, resolves any open PagerDuty incident, posts a recovery summary to the Discord triage thread, and archives it so channels don't fill with stale threads.

When to use it

Use it alongside the silence and escalation workflows so recoveries are handled automatically. It guarantees on-call never has to manually close a thread or ack a resolved page, and it leaves a clean downtime record for SLA reporting.

How it works

  1. 1A heartbeat webhook arrives from a device currently marked down.
  2. 2A logic step confirms the device had an open incident and computes total downtime from first-missed to now.
  3. 3An action resolves the matching PagerDuty incident.
  4. 4The workflow posts an all-clear with the downtime figure to the device's Discord triage thread and archives it.
  5. 5An action closes the incident row in Postgres, recording the resolution time for reporting.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect PostgresAny Postgres URL — query, write, migrate.
  3. 3
    Connect PagerDutyIncidents, on-call, escalations.
  4. 4
    Connect DiscordCommunity channels + voice + bots.
  5. 5
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  6. 6
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  7. 7
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.