DEVOPS

Correlate flaky failures with Honeycomb traces and start a Slack triage thread

When a flaky test recurs, pulls the matching CI trace from Honeycomb, has an agent reason about the likely root cause (timing, resource contention, network), and opens a Slack…

CategoryDevOps
EngineSim + Paperclip
Difficultyadvanced
Triggerwebhook
Steps5
Setup~25 min

How it runs

The automated pipeline, trigger to output.

  • TriggerFlake-detected webhook receivedHTTP webhook
  • ActionFetch CI run trace spans from HoneycombHoneycomb
  • LogicAgent classifies likely root cause from spans
  • LogicBranch: quarantine vs fix-first recommendation
  • OutputOpen Slack triage thread with recommendationSlack

What it does

This workflow links an intermittent test failure to telemetry. On a flake event it fetches the related CI execution trace from Honeycomb, then an agent analyzes span timings and errors to hypothesize whether the failure is a race condition, a slow dependency, resource contention, or genuine product bug. It starts a Slack triage thread summarizing the evidence and a suggested quarantine-or-fix call.

When to use it

Use it when flakes are hard to diagnose and your team wastes triage time guessing. Pairing the failure with trace data turns a vague flake into an actionable hypothesis.

How it works

  1. 1A flake-detected webhook fires with the test identifier and run timestamp.
  2. 2The workflow queries Honeycomb for the trace spans covering that test run.
  3. 3An agent reviews span latencies, retries, and errors to classify the likely cause.
  4. 4A branch decides whether the evidence points to test infrastructure (recommend quarantine) or product code (recommend fix-first).
  5. 5It opens a Slack thread with the trace summary, classification, and recommendation, mentioning the owning team.
  6. 6The Slack thread link is returned as output.

Set it up

What you configure once, before turning it on.

  1. 1
    Connect HTTP webhookTrigger any URL on agent actions.
  2. 2
    Connect HoneycombDistributed traces and queries.
  3. 3
    Connect SlackChannels, DMs, threads, mentions.
  4. 4
    Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
  5. 5
    Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
  6. 6
    Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.