AI AGENTS
Replicate-to-HuggingFace Fallback Evaluation Agent
When a pinned Replicate model is deprecated with no successor, the agent searches HuggingFace for equivalent models, evaluates the top candidates on your golden prompts.
How it runs
The automated pipeline, trigger to output.
- TriggerManual or scheduled start on a no-successor deprecation
- ActionRead deprecated Replicate model task metadataReplicate
- ActionSearch HuggingFace for comparable modelsHugging Face
- LogicShortlist candidates by license, downloads, and task match
- ActionEvaluate candidates on golden prompts for quality and costHugging Face
- OutputOpen Linear ticket with ranked recommendationLinear
What it does
Sometimes a deprecated Replicate version has no drop-in successor. This agent handles that harder case: it searches HuggingFace for models matching the deprecated one's task and architecture, shortlists candidates, evaluates each against your golden prompt set, and writes a ranked replacement recommendation into a Linear ticket.
When to use it
Use it when a model you depend on is being sunset entirely and you need to choose a new vendor or open-weights model, not just bump a version hash.
How it works
- 1A schedule or manual trigger starts the evaluation when a no-successor deprecation is detected.
- 2The agent reads the deprecated model's task tags and searches HuggingFace for comparable models.
- 3A logic step shortlists candidates by downloads, license, and task match.
- 4The agent runs golden prompts against each shortlisted candidate and scores quality, latency, and cost.
- 5It ranks the candidates and drafts a recommendation with the scoring table.
- 6It opens a Linear ticket with the recommendation and attaches sample outputs for review.
Set it up
What you configure once, before turning it on.
- 1Connect Hugging FaceModels, datasets, spaces — the open-source hub.
- 2Connect LinearIssues, projects, cycles, triage.
- 3Connect ReplicateImage, video, and model inference.
- 4Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
- 5Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
- 6Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.
More AI Agents workflows
Custom Metrics Cardinality Spike Pager
A webhook from a Datadog monitor fires when custom-metric cardinality jumps; an agent pinpoints the offending metric and tag, estimates the added cost.
Sentry-to-Confluence Runbook Updater
When a Sentry issue is resolved, the agent finds the matching Confluence runbook page and proposes an inline update with the verified fix.
Stale Doc-PR Chaser for Runbook Gaps
On a daily schedule the agent finds runbook doc PRs that were opened from resolved incidents but never reviewed, summarizes what each one fixes.
Resolved Incident to Public Troubleshooting Doc
For customer-facing errors resolved in Sentry, the agent drafts a sanitized troubleshooting entry and opens a PR to your ReadMe documentation.
On-Call Runbook Gap Closer: Resolved Sentry Issues to Doc PRs
An agent reads each newly resolved Sentry issue, compares the actual fix against your existing runbook, and opens a GitHub PR adding the missing remediation steps.
Weekly On-Call Doc-Gap Digest
Each week the agent reviews every Sentry issue resolved in the last 7 days, ranks the ones whose runbook coverage is missing or thin.
Run it inside a business
This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Run this workflow in your colony.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
