RAG & Retrieval
40 entriesTools and patterns for retrieval-augmented generation, vector search, embeddings, and knowledge retrieval pipelines.
Agent Harness Pattern
trialArchitectural pattern where all non-model code surrounding an LLM (planning, tools, sub-agents, context management) is p...
Agent Memory as Infrastructure
assessTreats AI agent memory as first-class infrastructure with lifecycle hooks, layered storage, async writes, and active mai...
Agent Skills Specification
adoptAn open standard for packaging reusable procedural knowledge as markdown files that AI coding agents can discover, load,...
AGENTS.md
trialAn open cross-platform specification for a repository-root Markdown file that provides AI coding agents with project con...
Agno
assessOpen-source Python framework, stateless FastAPI runtime (AgentOS), and control-plane UI for building and operating multi...
AnythingLLM
assessA self-hosted AI chat application with workspace-isolated RAG, a zero-config desktop app, and multi-provider LLM support...
BMAD Method
assessA framework for structuring AI-assisted development using six specialized agent personas and versioned documentation art...
ChromaDB
trialOpen-source AI-native vector database designed for prototyping and RAG applications, with a 2025 Rust-core rewrite addin...
Claude Northstar
assessMIT-licensed CLAUDE.md harness that bootstraps any git repo for autonomous goal-oriented agent operation, replacing sequ...
Cloudflare Workers AI
assessServerless GPU inference platform running 50+ open-weight models on Cloudflare's global network, with pay-per-token pric...
Cognee
assessOpen-source Apache-2.0 knowledge engine for AI agent memory that combines vector search and graph databases to ingest 30...
DeepEval
trialOpen-source Apache-2.0 LLM evaluation framework by Confident AI with 50+ metrics spanning RAG, agents, multi-turn conver...
Dify
assessAn open-source platform for building LLM-powered applications via a visual drag-and-drop workflow builder with built-in...
Flowise
assessA lightweight visual builder for constructing AI chatbots and RAG pipelines using drag-and-drop LangChain components, de...
GitNexus
assessOpen-source code intelligence engine that indexes repositories into a precomputed knowledge graph and exposes 16 MCP too...
Haystack (deepset)
trialOpen-source Python AI orchestration framework by deepset for building production-ready LLM applications, RAG pipelines,...
Hippo Memory
assessZero-dependency TypeScript CLI and npm package implementing biologically-inspired AI agent memory with exponential decay...
InfiniFlow
assessShanghai-based AI infrastructure company behind RAGFlow (78.5k+ star open-source RAG engine) and Infinity, an AI-native...
Langflow
assessA visual IDE for building AI agents and RAG applications with native LangGraph integration for stateful multi-agent work...
Langfuse
trialOpen-source LLM engineering platform (MIT-licensed, 21k+ GitHub stars) covering observability traces, evaluation, prompt...
LlamaIndex
trialOpen-source MIT-licensed data framework for building RAG and document agent applications on top of LLMs, with 38k+ GitHu...
MemPalace
assessLocal-first open-source AI memory system using a hierarchical palace metaphor (Wings/Rooms/Halls) over ChromaDB vector s...
Milvus
trialApache-2.0 distributed vector database for billion-scale similarity search, built for cloud-native Kubernetes deployment...
Open WebUI
trialA self-hosted, provider-agnostic web interface for LLMs with built-in RAG, MCP support, RBAC, and Ollama integration for...
OpenSpec
assessA CLI framework that adds a specification layer to AI-assisted development with structured proposals, specs, and task br...
OpenViking
assessA context database by ByteDance that organizes AI agent memory via a virtual filesystem with tiered content loading to r...
Perplexity AI
assessAI-native answer engine and agentic browser company pivoting from AI-assisted search to autonomous multi-step task execu...
RAG Pipeline
assessRetrieval-Augmented Generation pattern that grounds LLM responses in retrieved documents to reduce hallucination and ena...
RAGAS
trialOpen-source Apache-2.0 evaluation framework for RAG pipelines and LLM applications by ExplodingGradients (YC W24), provi...
RAGFlow
assessOpen-source Apache-2.0 RAG engine by InfiniFlow specializing in deep document understanding (OCR, layout analysis, table...
Retrieval-Augmented Generation (RAG)
adoptAn LLM inference pattern that injects relevant documents retrieved from an external corpus into the model's context at q...
Spec-Driven Development
trialDevelopment pattern where structured specification documents are written before code and serve as the primary input for...
Supabase
trialOpen-source Firebase alternative providing managed PostgreSQL, authentication, storage, and serverless Edge Functions as...
Thunderbolt
assessOpen-source, self-hosted enterprise AI client by MZLA Technologies (Mozilla) offering multi-platform native apps, multi-...
TruLens
assessOpen-source MIT-licensed LLM evaluation and tracing framework by TruEra, now maintained by Snowflake, combining OpenTele...
VectorDBBench
assessOpen-source benchmarking tool for vector databases, covering 30+ databases with CLI and visual interface; maintained by...
Volcano Engine
assessByteDance's enterprise cloud platform offering AI services, the Doubao LLM family, VikingDB vector database, and agent t...
Weaviate
assessVector database supporting hybrid vector-keyword search, automatic vectorization, and multi-tenancy for AI-native applic...
Weaviate Engram
assessMemory and context layer for AI agents built on Weaviate, organizing persistent semantic memory by topic across sessions...
Zilliz Cloud
trialFully managed vector database service built on Milvus, operated by Zilliz with enterprise-grade SLA (99.95%), SOC 2 Type...
Related Reviews
The AI Engineering Stack We Built Internally — On the Platform We Ship
Ayush Thakur, Scott Roe-Meschke, Rajesh Bhatia · Apr 22, 2026
Thunderbolt: Mozilla's Open-Source Self-Hosted Enterprise AI Client
MZLA Technologies (Mozilla) · Apr 22, 2026
Zilliz Ecosystem Review: Milvus, Zilliz Cloud, and the Vector Database Toolchain
Tech Radar Analyst · Apr 22, 2026
Lovable: AI-Powered App Builder — Product Review
Unknown (vendor site) · Apr 21, 2026
Built for Humans, Consumed by Agents: The Next Decade of Sports Digital Platforms
Mark Shannon · Apr 20, 2026