Agent Hive mark

The Ultimate Open-Source AI Stack

368 is not a marketing number. It is the count of actively maintained open-source repositories we cataloged in 2026 that ship production-usable AI capabilities, each with at least one release in the past 90 days and at least 500 GitHub stars. The combined star count across all 368 repos exceeds 3.1 million, which gives you a rough sense of the engineering attention going into the ecosystem. That is 3.1 million developers collectively deciding something is worth bookmarking.

The problem with lists like this is that they usually present every repo as a candidate for everyone. They are not. A solo developer building a personal chatbot needs exactly one repo from the "self-hosted chat UI" row. An ML team running multi-model inference at scale needs something from the "model serving" row. A startup replacing Notion and Slack needs the "growth and operations" row. Handing everyone the full 368-repo list is the same mistake as telling someone who wants a sandwich to read the entire menu.

This post does not rank repos against each other. It organizes the field by the decision you are actually making: what do you want to build or replace, and which cluster of repos is the right starting point. Every cluster post in this series goes deep on a specific row. This pillar gives you the map.

The five clusters are: AI building blocks, creator stack, engineering, growth and operations, and model infrastructure. Most teams need exactly one or two of these clusters. If you find yourself thinking "we need all five," you are probably overfitting the analysis. Prioritize the cluster that unblocks your most pressing constraint, ship, then revisit.

One thing worth saying directly: the open-source options in 2026 are not "good enough for hobbyists but not production-ready." The repos in the AI building blocks cluster are used in production by companies with millions of users. Qdrant is used by Tripadvisor. Weaviate is used by Shopify. pgvector is embedded in Supabase's free tier. The quality bar has shifted.

Decision matrix

AI building blocks

The largest cluster in the catalog and the one that saw the most new entrants in 2025-2026. If you are writing code that calls a language model, you will draw from this row. The repos here cover five sub-functions: agent memory, vector databases, model serving, LLM orchestration, and observability. Each sub-function has three to eight credible open-source options, and the depth posts in this cluster cover each sub-function as a separate article with head-to-head comparisons.

The star counts in this cluster are striking. Ollama alone holds 172,526 stars, which places it in the top 100 most-starred repos on GitHub regardless of category. Open WebUI at 139,066 is not far behind. These are not niche tools; they are the infrastructure layer of the 2026 AI application stack.

The cluster posts for this row cover agent memory, vector databases, and model serving in depth. The Mem0 vs Letta vs Zep comparison is the first depth piece in this cluster.

Creator stack

Image generation, video synthesis, and audio tools have had a more dramatic open-source moment than any other category in 2025-2026. The commercially licensed alternatives from Midjourney or ElevenLabs charge per generation; the open-source equivalents run on a single GPU and cost only electricity. The quality gap between open-source and commercial image generation closed faster than anyone expected: FLUX.1 [schnell] produces images that are indistinguishable from Midjourney v6 at equivalent resolution, and it is Apache 2.0 licensed.

The creator stack cluster is split between teams that need a polished UI (Stable Diffusion WebUI, ComfyUI) and teams that need a pipeline-programmable backend they can call from code (diffusers, FLUX API, Kokoro TTS). Both groups have good options.

The creator stack cluster post benchmarks these repos against each other and covers ControlNet, LoRA fine-tuning, and voice cloning in depth.

Engineering cluster

The fastest-growing sub-category in the catalog: AI tools that sit inside the software development loop. Code review, autocomplete, test generation, repo search, and PR summarization all have credible open-source implementations. The commercial tools (GitHub Copilot, Cursor) charge $10-20/month per developer; the open-source alternatives let you bring your own model and pay only for API calls.

The architectural distinction in this cluster is between IDE extensions (Continue, Cline) and terminal-first tools (Aider). IDE extensions integrate with your existing editor workflow. Terminal-first tools operate on the full repo from outside the editor, which makes them better for large refactors and for running on CI.

The engineering cluster post covers Aider vs Cline vs Continue in a head-to-head comparison with real benchmark numbers from SWE-bench and similar evaluations.

Growth and operations

The most underrated row in the matrix. Small businesses and ops-heavy startups pay $4,000+ per month across Notion, Intercom, Zapier, and similar SaaS tools. This cluster maps replacements: self-hosted wikis, customer support bots, workflow automation, and CRM alternatives. The open-source options here are not as polished as the commercial leaders, but they are within two to three years of parity, and the data-ownership argument has become more compelling as SaaS privacy terms have gotten less favorable.

The key insight for this cluster: the replacement projects that work best are not "open-source X" (where X is a specific SaaS tool). The ones that work are projects that do the underlying job well on their own terms, like n8n for automation, which has a workflow model that is genuinely different from and in some ways better than Zapier rather than merely a free imitation of it.

The growth and operations cluster post covers replacing Notion, Intercom, and Zapier step by step with setup guides for each.

Model infrastructure

Fine-tuning, quantization, evaluation, and serving at scale are their own discipline. If you are not just calling a hosted API but actually owning the model lifecycle, this row is for you. The repos here let you take an open-weight base model (Llama 3.3, Qwen 2.5, Mistral Nemo) and specialize it for a specific task, then serve it at production throughput.

The entry point to this cluster for most teams is Ollama plus llama.cpp for local development, then vLLM for production serving. Fine-tuning enters the picture when the base model is not performing well enough on your domain, which typically means you have at least a few thousand high-quality labeled examples.

The teams who ship fastest pick one tool per layer, accept its limitations, and revisit the decision when the limitations become the bottleneck.

— The reading rule

How to use the catalog

The downloadable catalog below contains all 368 repos with six columns: repo path, display name, category, sub-category, star count (as of the catalog generation date), and license. The most useful filters for typical readers are:

Category = ai-building-blocks, license = Apache-2.0 or MIT: production-usable repos with permissive licenses, no copyleft restrictions for commercial use.
Category = engineering, stars > 10000: tools that have meaningful community support and are unlikely to be abandoned.
Sub-category = vector-database: the 12 repos covered in the vector databases depth post, with their current star counts and license data.

The catalog is a snapshot. Stars change, maintainers change, projects get acquired or abandoned. The data was accurate at time of generation; for live star counts, check the GitHub pages linked in each cluster post.

Our take

The open-source AI stack in 2026 is genuinely production-ready across all five clusters above, and the quality gap with commercial alternatives has closed faster than most observers expected. The catalog is not "pick everything from column A"; it is "pick one row, pick the top repo in that row, and ignore the rest until you have a shipping product." The teams we see getting stuck are the ones who spend three weeks evaluating seven vector databases before writing a single query. Pick Qdrant or pgvector, ship something, and optimize later. The teams who ship fastest pick one tool per layer, accept its limitations, and revisit the decision when the limitations become the bottleneck. That discipline is harder than it sounds when you are looking at a list of 368 options, but it is the only approach that reliably results in a working system.

The Ultimate Open-Source AI Stack: 368 Repos Categorized for 2026

The only platform to run an AI-native company.