Skip to content

Agent Swarm

★ New
assess
AI / ML open-source MIT freemium

At a Glance

Open-source TypeScript/Bun multi-agent orchestration framework by desplega.ai with lead/worker Docker isolation, session-based compounding memory via OpenAI embeddings, and integrations for Slack, GitHub, GitLab, and email.

Type
open-source
Pricing
freemium
License
MIT
Adoption fit
small, medium
Top alternatives

What It Does

Agent Swarm implements a lead/worker coordination pattern for AI coding agents. A lead agent (Claude Code instance) receives tasks from Slack, GitHub, GitLab, email, or CLI, decomposes them, and delegates subtasks to worker agents running in isolated Docker containers with full development environments. An MCP API server backed by SQLite tracks task lifecycle, inter-agent communication, and coordination state.

The framework differentiates itself with a “compounding memory” system: after each session, a lightweight model extracts a summary of mistakes, patterns, and codebase knowledge, stores it in SQLite, and indexes it via OpenAI text-embedding-3-small embeddings. Future sessions retrieve relevant past summaries as context. Agents also maintain four self-editing identity files (SOUL.md, IDENTITY.md, TOOLS.md, CLAUDE.md) that persist across container restarts.

Key Features

  • Lead/worker hierarchy: one lead agent decomposes and delegates; workers run in Docker containers with Node.js, Python, and Git pre-installed
  • SQLite-backed MCP API server with OpenAPI 3.1 spec and Scalar UI documentation
  • Session-based memory extraction using lightweight LLM models with OpenAI text-embedding-3-small retrieval
  • Four persistent identity files per agent (SOUL.md, IDENTITY.md, TOOLS.md, CLAUDE.md) that self-update across sessions
  • Nine pre-built agent templates: lead, coder, researcher, reviewer, tester, FDE, content-writer, content-reviewer, content-strategist
  • Integrations: Slack (Socket Mode, thread-based progress), GitHub App (issues, @mentions, PRs, CI), GitLab (MRs, @mentions, pipelines), AgentMail, Sentry
  • DAG-based workflow engine with checkpoint durability, version history, and human-in-the-loop approval nodes
  • MCP server management with scope cascading (agent → swarm → global)
  • x402 USDC micropayment support for gated API access
  • Real-time dashboard at app.agent-swarm.dev with context window usage tracking
  • Skill system for reusable procedural knowledge
  • Service discovery between workers; HTTP service exposure from containers
  • Scheduled/cron-based task execution
  • Cloud offering: Agent Swarm Cloud (€9/month + €29/month per worker)

Use Cases

  • Use case 1: Parallel code review and testing — assign a reviewer agent and tester agent to work concurrently on a PR while a coder agent iterates on fixes
  • Use case 2: Slack-driven development workflows — post a feature request to Slack; the lead agent decomposes it, assigns subtasks to workers, and reports progress back in-thread
  • Use case 3: GitHub issue assignment — install the GitHub App, @mention the agent in an issue, and the swarm picks it up autonomously
  • Use case 4: Content production pipelines — coordinate content-writer, content-reviewer, and content-strategist agents on blog or documentation production
  • Use case 5: Multi-step research tasks — use a researcher agent to gather context, feed it to a coder agent, and validate with a tester agent

Adoption Level Analysis

Small teams (<20 engineers): Fits with caveats. The Docker Compose deployment is accessible and the Slack/GitHub integrations reduce setup friction. However, OpenAI embedding costs (per session summary), Claude Code OAuth token requirements, and operational overhead of managing Docker worker containers add non-trivial complexity. The €29/month per worker cloud pricing is accessible at small scale.

Medium orgs (20–200 engineers): Partially fits. The workflow engine, DAG support, and human-in-the-loop approval nodes make it viable for structured engineering workflows. However, the framework is at v1.67.2 with 355 GitHub stars — modest adoption signals relative to alternatives like Vibe Kanban (23k+ stars) or Claude Flow (21k+ stars). No published production deployments or post-mortems were found. Teams should evaluate carefully before committing.

Enterprise (200+ engineers): Does not fit currently. No enterprise security documentation, no SOC 2 certification, no documented access control beyond MCP scope cascading, and no evidence of production deployment at scale. The x402 autonomous payment feature is a potential risk without documented spending guardrails.

Alternatives

AlternativeKey DifferencePrefer when…
Vibe KanbanLocal Kanban UI, git worktrees, 23k+ stars, no persistent memory overheadYou want minimal ops, visual oversight, and proven community traction
Claude Flow (Ruflo)21k+ stars, 314 MCP tools, 16 agent roles, shared memoryYou need a richer tool ecosystem and broader community support
OpenHandsModel-agnostic, cloud + self-host, 70k+ stars, ICLR 2025 paperYou want model flexibility beyond Claude and production-grade validation
Composio Agent OrchestratorOpen-source dual-layer orchestrator, structured workflowsYou need structured agentic workflow primitives without the full Docker swarm overhead
MulticaKanban task assignment, WebSocket streaming, pgvectorYou want a Kanban-style interface without Docker container isolation

Evidence & Sources

No independent benchmarks, production case studies, or post-mortems found.

Notes & Caveats

  • OpenAI API dependency: The compounding memory system requires OpenAI text-embedding-3-small API access. Self-hosted deployments incur ongoing embedding costs that scale with session count. This creates a vendor dependency even though the framework itself is MIT-licensed.
  • Claude Code lock-in: The framework is explicitly designed around Claude Code as the primary agent runtime. Workers could in principle use other agents (Gemini CLI is mentioned in positioning), but the architecture and identity files are Claude-centric.
  • x402 payment risk: The autonomous USDC micropayment feature for x402-gated APIs has no documented spending limits, authorization flows, or audit logging in the reviewed documentation. Deploying this in production without custom guardrails creates unbounded spending risk.
  • Star count vs. alternatives: At 355 stars (April 2026), Agent Swarm is roughly two orders of magnitude behind Vibe Kanban (23k+) and Claude Flow (21k+). This may reflect early-stage status or limited adoption; it is not evidence of quality.
  • Company primary business: desplega.ai’s core product is AI-powered E2E testing (not agent orchestration). Agent Swarm may be a strategic side project rather than the company’s primary engineering focus. Prioritization risk is worth monitoring.
  • No migration path documented: No documentation found for migrating from Agent Swarm to alternatives or exporting accumulated agent memory in a portable format. The SQLite database and identity files are the primary lock-in mechanisms.
  • Dashboard is a separate hosted service: The real-time monitoring dashboard runs at app.agent-swarm.dev, meaning operational visibility depends on a vendor-hosted service even for self-hosted deployments.

Related