The bottleneck in software engineering is no longer model intelligence — it's human attention. This repo gives you the primitives to decide what to build, then run the how asynchronously for hours or days, with safety rails that survive long-horizon execution.
Three roles do the work: Orchestrators plan and define correctness, Workers implement against clean context, Validators verify adversarially. Features run with scope-isolated parallelism (one agent, one worktree, one branch) to avoid stepping on each other; reads, research, and reviews always run in parallel. State survives long pauses through structured handoffs at milestone boundaries.
The skills here are harness-agnostic: they install into Claude Code (.claude/skills/), Codex (.agents/skills/), Gemini (.gemini/), and any other harness that can discover Markdown-based skill definitions. The same SKILL.md file drives the same workflow regardless of which assistant invokes it. Repo-level guidance is similarly cross-harness: AGENTS.md is a symlink to CLAUDE.md, so Codex (which looks for AGENTS.md) and Claude Code (which looks for CLAUDE.md) read the same canonical contract.
Plan work, define correctness up-front via OpenSpec scenarios, decompose into work packages, dispatch and converge.
/explore-feature— surface high-value next features from architecture artifacts and code signals/plan-feature— create an OpenSpec proposal with approaches considered + work packages/iterate-on-plan— refine the proposal via structured self-review before approval/prototype-feature— dispatch N parallel variant skeletons for human pick-and-choose convergence/implement-feature— orchestrate work-package dispatch (also a Worker for sequential tier)/plan-roadmap//autopilot-roadmap— decompose a multi-feature proposal and execute it iteratively/autopilot— drive the full plan → review → implement → validate → PR lifecycle with multi-vendor convergence
Implement work packages against clean context, scoped to non-overlapping write_allow globs.
/implement-feature— write code, tests, and docs per work package/iterate-on-implementation— refine the implementation via structured self-review/quick-task— delegate small ad-hoc tasks to any configured vendor without OpenSpec ceremony
Verify correctness adversarially across two surfaces:
- Scrutiny validators —
/parallel-review-planand/parallel-review-implementationdispatch vendor-diverse reviewers (Claude, Codex, Gemini) and merge their structured findings againstreview-findings.schema.jsonviaconsensus_synthesizer.py - Behavioral validators —
/gen-evalruns scenarios against the live deployment (HTTP/MCP/CLI surfaces); the Playwright validator covers frontend surfaces. Both produce findings in the same schema, merged into a single ranked list - Quality validators —
/bug-scrub,/security-review,/tech-debt-analysis,/simplifyfor cross-cutting health checks
Multi-agent coordination system for AI coding assistants. Provides file locking, work queues, session handoffs, and agent discovery backed by PostgreSQL (ParadeDB by default for local development; Supabase supported as an optional cloud-managed backend).
The coordinator pairs with worktree isolation (.git-worktrees/<change-id>/<agent-id>/) so that multiple agents can write in parallel without colliding on the shared checkout. Each agent gets its own checkout, its own branch (openspec/<change-id>--<agent-id>), and a heartbeat-backed lease in the worktree registry. When the work converges, merge_worktrees.py integrates the per-package branches back into the feature branch. In cloud-harness environments where each agent already gets an isolated container, worktree operations short-circuit transparently — see docs/cloud-vs-local-execution.md.
- Overview — Architecture, capabilities, and design decisions
- Quick Start — Setup, installation, and MCP integration
- Parallel Agentic Development — Worktree isolation, scope discipline, parallel DAG execution
- Specification — formal requirements
Structured feature development workflow distributed as harness-agnostic skills. Each skill is a SKILL.md file with YAML frontmatter (name, description, triggers, user_invocable, related) and Markdown body. skills/install.sh rsyncs the canonical sources at skills/ into harness-specific runtime directories (.claude/skills/, .agents/skills/ for Codex, plus per-vendor variants). The same skill drives the same workflow whether invoked via Claude Code's /skill-name, Codex's slash palette, or Gemini's tooling.
- Workflow Guide — Stage-by-stage explanation and design principles
- Skills Catalogue — Discoverable index of every skill grouped by purpose
- Project Guidelines — Workflow tables, conventions, worktree contract, git conventions
- Specification — formal requirements
Agent Coordinator: Follow the Quick Start to start PostgreSQL (ParadeDB via Docker Compose, or Supabase), install dependencies, and configure your harness's MCP integration.
Skills Framework: Run bash skills/install.sh --mode rsync --deps none --python-tools none to sync skills into the harness directories present in your repo. Each skill is then invocable via your harness's slash-command palette. Start with /plan-feature to create a proposal for your next feature, or /explore-feature if you don't yet know what to build.
Cross-repo usage: To use these skills from a different repository, see docs/cross-repo-setup.md.
agentic-coding-tools/
├── agent-coordinator/ # Multi-agent coordination system (MCP server, locking, work queue)
├── skills/ # Canonical skill sources (≈55 skills, harness-agnostic)
│ ├── plan-feature/ # Feature workflow: plan → implement → validate → cleanup
│ ├── autopilot/ # Multi-vendor convergence orchestrators
│ ├── parallel-review-*/ # Vendor-diverse adversarial review
│ ├── worktree/ # Worktree lifecycle (setup/teardown/merge/GC)
│ ├── references/ # Shared checklists cited by multiple skills
│ └── tests/ # Skill tests (excluded from install.sh rsync)
├── openspec/ # Specifications and proposals
│ ├── specs/ # 21 formal specifications
│ └── changes/ # Active and archived proposals
├── docs/ # Documentation (see CLAUDE.md § Documentation for the index)
├── evaluation/ # Benchmarking harness for coordination effectiveness
├── formal/ # Formal models and verification artifacts
├── scripts/ # Cross-cutting helper scripts
├── .agents/skills/ # Codex runtime copy (regenerated by skills/install.sh)
├── .claude/skills/ # Claude Code runtime copy (regenerated by skills/install.sh)
├── .codex/ # Codex harness config
├── .gemini/ # Gemini harness config
└── .githooks/ # pre-commit / post-merge hooks (incl. coordinator-task-status-renderer)
Important: .claude/skills/ and .agents/skills/ are runtime copies rsynced from skills/ by install.sh. Never edit them directly — changes will be overwritten. Always edit the canonical sources at skills/<skill-name>/SKILL.md.
All features are formally specified using OpenSpec. The full set lives at openspec/specs/. Headline specs:
| Spec | Description |
|---|---|
| agent-coordinator | File locking, work queue, MCP/HTTP, verification, guardrails |
| skill-workflow | Iterative refinement, parallel execution, worktree isolation |
| worktree | Worktree lifecycle, registry, branch resolution, GC |
| coordination-bridge | HTTP fallback when MCP transport is unavailable |
| agent-archetypes / agent-identity | Vendor routing and agent identity model |
| evaluation-framework / gen-eval-framework | Behavioural validation harnesses |
| live-service-testing / observability | Validation against deployed services |
| merge-pull-requests | PR triage, review, and merge from multiple sources |
| roadmap-orchestration | Multi-feature decomposition and autopilot execution |
See docs/skills-catalogue.md for the complete skill inventory and CLAUDE.md for the canonical workflow tables.
The canonical single-feature flow with all optional refinement and review gates:
/explore-feature [focus-area] (optional) → Candidate shortlist for next work
/plan-feature <description> → Proposal approval gate
/iterate-on-plan <change-id> (optional) → Refines plan before approval
/parallel-review-plan <change-id> (optional) → Independent plan review (vendor-diverse)
/prototype-feature <change-id> (optional) → N parallel variant skeletons + human pick-and-choose
/implement-feature <change-id> → PR review gate (runs spec + evidence validation)
/iterate-on-implementation <change-id> (optional) → Refinement before merge
/parallel-review-implementation <change-id> (optional) → Per-package review (vendor-diverse)
/cleanup-feature <change-id> → Done (runs deploy + security validation before merge)
Each skill auto-selects an execution tier (coordinated / local-parallel / sequential) based on coordinator availability and feature complexity. For long-running multi-feature work, /plan-roadmap + /autopilot-roadmap automate the loop with learning feedback between iterations.
For deeper detail see CLAUDE.md, docs/skills-workflow.md, and docs/parallel-agentic-development.md.
MIT