Building blocks · 10 posts
The open source pieces — vector databases, observability, agents, memory — that make an AI-run business possible.
Shielded RL is usually pitched as a runtime guard. The same automata-theoretic machinery produces a more useful artifact: an offline defensibility audit.
SkMTEB is the first MTEB-style benchmark for Slovak, covering 31 datasets across 7 task types to help operators pick the right embedding model.
14-day trial. No DevOps. No Sales call. Provisioned in under a minute.
Stanford's CS336 ships a CLAUDE.md file that tells coding agents what they may and may not do, keeping agents from replacing student learning.
A new audit finds LLMs carry measurable preferences for specific assets like Bitcoin, even in conservative portfolio prompts. Here is what operators need…
Latency-sensitive → Qdrant. Scale to billions → Milvus. Postgres shop → pgvector.
Tracing + analytics → Langfuse. Eval-driven dev → Promptfoo or DeepEval. RAG-specific → Ragas.
Full Docker-compose stack you can run on a 16GB Mac. Replaces ~$200/mo of OpenAI usage.
Single-user laptop → Jan or GPT4All. Team / multi-user → Open WebUI or LibreChat.
Self-host on a single VPS → pgvector or ChromaDB. Billion-scale → Milvus or Qdrant.
Pick Mem0 for fastest setup; Letta for stateful + tools; Zep for chat history at scale.