MARKET RESEARCH

Weekly Dataset Radar for a Research Vertical

Every Monday, scans Hugging Face for datasets newly published in your research vertical, clusters them by theme.

CategoryMarket Research

Enginesim

Difficultybeginner

Triggerschedule

Steps5

Setup~5 min

How it runs

The automated pipeline, trigger to output.

TriggerWeekly schedule (Monday AM)
ActionQuery Hugging Face for new datasets in verticalHugging Face
LogicFilter by size, license, recency
ActionCluster into themes and rankOpenAI
OutputPost ranked digest to SlackSlack

What it does

Runs a scheduled hunt across Hugging Face for datasets created or updated in the last seven days that match your vertical's keywords (e.g. "clinical NLP", "battery materials", "fraud detection"). It pulls metadata — task type, size, license, downloads — clusters the results into a handful of themes, ranks each by relevance and traction, and delivers a single skimmable digest to Slack.

When to use it

For research, ML, or competitive-intel teams who need to stay current on the open-data landscape but don't have time to browse the Hub manually. One standing report beats ten ad-hoc searches.

How it works

1A weekly cron fires Monday morning.
2Hugging Face is queried for datasets matching each vertical keyword, filtered to the last 7 days.
3A filter drops items below a minimum size or with non-permissive licenses.
4An LLM clusters the survivors into named themes and writes a one-line take on each.
5The ranked, clustered digest is posted to the team's Slack research channel.

Set it up

What you configure once, before turning it on.

1
Connect Hugging FaceModels, datasets, spaces — the open-source hub.
2
Connect OpenAIModels, embeddings, files.
3
Connect SlackChannels, DMs, threads, mentions.
4
Set each agent's modelWe leave models unset so you pick the tier — fast + cheap, or top-quality.
5
Tune it to your dataEdit the prompts, filters, and field mappings so it matches how your team works.
6
Test, then turn it onRun once against a sample, confirm the output, then enable the trigger.

More Market Research workflows

Enrich Inbound Accounts with BigQuery Firmographics and Score Fit

When a new account row lands in Airtable, joins it against BigQuery public business datasets to attach firmographic attributes.

Blend BigQuery TAM with Live Competitor Signals into a Notion Brief

On demand, sizes a chosen segment from BigQuery public data, gathers current competitor signals via Brave Search, and synthesizes a one-page market brief into Notion.

Allocate Sales Territory TAM from BigQuery Geo Data to HubSpot

When triggered by a webhook, queries BigQuery public ZIP-level business data to compute TAM per sales territory.

Hiring Surge Detector with Slack Alert

Detects when a target account's open-role count jumps above its recent baseline and posts a ranked Slack alert to the GTM channel so reps can act on a company that is clearly…

Tech-Stack Shift Inference from Job Descriptions

Reads new job descriptions for target accounts, uses an LLM to extract named technologies and infer stack changes.

Weekly Hiring-Intel Briefing for GTM

An agent reviews the week's accumulated hiring signals across all target accounts, writes a narrative briefing that infers each account's likely initiatives.

Browse all Market Research →

Run it inside a business

This workflow drops into a full company template. Import the org, and this is one of the playbooks its agents run.

Media

YouTube Studio

Scripts, edits, thumbnails, and scheduling — every week.

Software

AI Tools Startup

Ship an AI tool, distribute on every channel, watch the unit economics.

Software

Agent Hive runs Agent Hive

The team that built Agent Hive, exactly as it runs today.

Browse all business templates →Solutions by industry →

Run this workflow in your colony.

14-day trial. No DevOps. No Sales call. Provisioned in under a minute.

Join the Waitlist Browse all workflows →