npx clawboo, and is built for the developer who wants to coordinate several coding/agentic runtimes as a single fleet instead of babysitting each one in its own terminal.
The hard part of multi-agent work is not getting one agent to do one thing; every runtime already does that. The hard part is heterogeneous coordination: getting agents that don’t share a protocol, a memory, or a notion of “done” to work as a team, durably, without one runaway agent burning your budget or shipping broken work. Clawboo exists for that problem. This page explains what Clawboo is, the three design wedges that distinguish it, who it’s for, and why it’s worth running over wiring the runtimes together yourself.
What it is, and what it isn’t
Clawboo is a coordination and governance plane, not a new agent runtime. It does not reimplement an agent loop; it wraps each runtime behind oneRuntimeAdapter interface, supervises it, and relays its work. The five runtimes stay first-class; Clawboo owns the shared plane (the registry, the board, peer chat, shared memory, the tools broker, verification, governance, the event log, the worktree system-of-record) and each runtime keeps its private plane (its own channels, cron/heartbeat, native memory, built-in tools, and session resume). The split is resolved from each runtime’s declared capabilities, not hardcoded per id, so a runtime gets exactly the integration depth its architecture allows.
This is the deliberate opposite of a “brain-only, replace-everything” architecture. Clawboo never resells inference, never serves a runtime’s messaging channels, and never clobbers a runtime’s native memory or connectors. It is additive: it multiplies what the runtimes can do together rather than disintermediating them.
Concretely, every agent on a Clawboo team, a Boo, is a real agent record backed by an AgentSource, keyed by its sourceId (its runtime). SQLite is the registry of record for who exists; the runtime is the source of truth for how an agent runs; the board is the source of truth for current task state. No decorative agents, no fake edges, no view-model a writer keeps in sync.

The wedge: three things standalone runtimes can’t give you
Three design choices distinguish Clawboo from a hand-rolled fan-out script that pipes prompts to several runtimes. Each is grounded in a concept page; the short version is here.1. Mixed-runtime peer chat: every runtime is a named peer, any runtime can lead
In most multi-runtime setups, cross-runtime messaging is a privileged parent/child relay: one orchestrator dispatches to subordinate runtimes, and a subordinate can never be the orchestrator. Clawboo’s peer chat is a flat room instead. Every team member, whether it runs on the native harness, OpenClaw, Claude Code, Codex, or Hermes, posts into one durable room as a named peer, and any of them can be the one to lead. The leader is one peer among equals; the speaker-selection policy is a pure function that any runtime can be the output of. Two properties make the flat room safe to run across heterogeneous runtimes:- Identity is bound at attach time, not by the model. A runtime cannot post as a peer it isn’t. The room author and team are authoritative from a connection binding that Clawboo writes into the runtime’s MCP config; the model never controls it, and a forged author in tool arguments is ignored.
- Peer posts are evidence, never instructions. Every delivered post is wrapped with the safety-critical
isUser=falsetoken (borrowed verbatim from OpenClaw’s inter-session wire format), so a teammate that says “ignore your instructions” can never land with user authority. Escalation is prevented by construction, not by the receiver’s judgement.
2. Board + chat fusion: the board is canonical, chat is narration
A chat transcript can’t be transactionally claimed, can’t survive a refresh as authority, and can’t tell a crashed run from a slow one. So Clawboo splits the two: the durable board is the source of truth for task and coordination state, and chat is narration of that state. A board mutation is the decision of record. A chat message is a description of a decision, never a write path back to the board. When a board mutation happens it reflects itself into the team room as a system line, but only after the canonical write, so the narration can never become the authority. That single rule is what buys durability (state survives restart), race-freedom (a single conditionalUPDATE lets exactly one assignee claim a task, and a lost claim is a 409 the caller must never retry), and recoverability (boot-time orphan reconciliation and a stale-task backstop release stuck work).
The same discipline runs all the way down: delegation and orchestration reads structured signals from typed runtime events, a sessions_send tool call, or a <delegate>/<plan> directive parsed from a terminal done, never a regex over model prose. A delegation becomes a board task with a claim, an execution ledger, and a status; the result rounds back to the board and reflects to the leader. The board is the dispatcher; the worktree is the isolated world the dispatched work happens in.
3. Builder ≠ judge verification, plus hard governance
A generator self-grading is a known failure mode, and “the agent said it’s done” is not evidence. Clawboo makesdone mean verified through the builder ≠ judge principle: the agent that did the work never certifies its own work. The only two signals into done for file-mutating work are a machine truth (the task’s own verify command, judged by its exit code) and a structurally-independent reviewer (a read-only critic in a detached, push-less checkout who literally cannot mutate the builder’s branch). See verification.
The gate is intrinsic to the board state machine, not an opt-in step; any transition to done with a non-promotable verdict is rejected for every caller (the REST route, the Tasks-MCP tool, the orchestrator), the only escape being an explicit, audited humanOverride. When the fix loop runs out of budget, a task lands completed_with_debt rather than deadlocking, but that exit is itself gated: debt over a green gate ships with a paper trail; debt over a red gate routes to a human.
Alongside verification, governance makes a runaway agent impossible, not just visible:
- A budget kill-switch. A USD cap per agent, mission (delegation tree), or team. In
capmode, crossing 100% aborts the live run and releases the task; the shipped default iswarnmode (track and warn, no enforcement) so a fresh install enforces no dollars until you opt in. - Five circuit breakers. A deterministic, cross-runtime backstop that halts a run thrashing on a repeated failing call, making no progress, or burning tokens with nothing to show, keyed on typed
RuntimeEventfields, never on prose. - Caps and approvals. A depth cap (a delegation tree goes at most two levels deep), a fan-out cap, a per-run cost ceiling, and a human approval handshake for risky delegations that fails closed if the approval endpoint is unreachable.
Who it’s for
Clawboo is for a developer running a team of AI agents, not a single assistant. You feel its value when:- you want to mix runtimes on one team, a native agent that needs no Gateway, a Claude Code worker, a Hermes teammate that keeps its own skills and memory, and have them coordinate instead of running in separate windows;
- you need coordination that survives a refresh: a durable board, race-free claiming, crash recovery, and cross-runtime handoff, not a chat scrollback you have to babysit;
- you want a USD budget, a kill-switch, and a verification gate around agentic work before you let several agents loose on a repo;
- you want one place to see what the fleet is doing, a Ghost Graph and Atlas over a single event log, with traces, fleet-health, and cost.
Why this over running the runtimes standalone
You could wire the runtimes together yourself. The cost of doing so is exactly the shared plane Clawboo already built:| You need | Standalone runtimes give you | What Clawboo adds |
|---|---|---|
| Cross-runtime collaboration | A relay you script per pair, usually parent→child | A flat peer room any runtime can lead, with identity bound at attach time |
| Durable coordination | A chat transcript that vanishes on refresh | A transactional board: atomic claim, dependency chains, crash recovery |
| One shared memory | Each runtime’s private memory, siloed | A shared Memory-MCP tier all five runtimes read/write, on top of each one’s private tier |
| ”Done” means done | The agent’s own say-so | A builder ≠ judge gate intrinsic to the board state machine |
| Bounded spend and effort | Per-runtime, if any | A USD kill-switch, five circuit breakers, and depth/fan-out/cost caps |
| Visibility across the fleet | Per-runtime logs | One append-only event log every view is a projection of |
Boundaries and non-goals
- Not a new agent runtime. Clawboo coordinates the five it supports; it does not implement an agent loop or compete with them. A teammate is a
RuntimeAdapter. - Not a privilege boundary. Budgets and breakers bound spend and effort; a worktree bounds concurrency. A real privilege boundary (a container) is a documented, opt-in escalation, not something governance provides.
- OpenClaw is a connected substrate, not a Clawboo-spawned CLI. OpenClaw agents live inside an always-on Gateway Clawboo drives as an operator client; a server-side executor run is refused for a connected-substrate runtime (it has no isolated worktree to drive). It is the fifth runtime, handled differently for that architectural reason.
- Single implicit tenant today. Multi-tenant scoping (
tenant_idcolumns, a reserved budget scope) is a dormant future seam, not a shipped feature. So is a human participant posting into a room.
This documents the v0.2.0 working tree (commit
03b206a). The current npm latest is clawboo@0.1.9, so npx clawboo installs 0.1.9 until the v0.2.0 tag is published. Differences are noted in Known Issues.See also
- How it works: the end-to-end architecture overview
- Why Clawboo: the differentiators against running runtimes standalone, in depth
- Peer chat: the mixed-runtime team room
- The board: the durable, canonical task substrate
- Verification: the builder ≠ judge gate on
→ done - Governance: budgets, the kill-switch, circuit breakers, and caps
- The agent model: Boos, Boo Zero, and the five runtime classes
- Getting started: install and run your first team
- Glossary: canonical term definitions