COMPASS

Comprehensive Observability Multi-Agent Platform for Adaptive System Solutions

AI-powered incident investigation platform that reduces MTTR by 67-90% using parallel OODA loops, ICS principles, and scientific methodology.

What is COMPASS?

The Problem: Traditional incident investigation tools require senior engineers to manually connect dots between metrics, logs, and traces. Average MTTR: 2-4 hours. Knowledge concentrated in a few experts.

The Solution: COMPASS uses AI agents with scientific methodology to systematically test hypotheses in parallel, filtering out noise and presenting only high-confidence root causes to humans.

Key Differentiators:

🧪 Scientific rigor: Systematic hypothesis disproof (not just pattern matching)
⚡ Parallel OODA loops: 5+ agents testing simultaneously (10x faster than sequential)
🤖 Bring your own LLM: OpenAI, Anthropic, or any provider (cost-controlled)
👥 Learning Teams approach: Focus on contributing causes, not blame
📄 Automatic post-mortems: Markdown documentation for every investigation
💰 Cost-aware: $10/investigation routine, $20 critical (transparent budgets)

Current Status: Production-grade foundation ready for Database Agent implementation

Project Status

🚀 Phase 5 Complete - Multi-Agent Orchestrator (Production-Ready)

Current Capabilities:

✅ Multi-Agent Orchestration - Sequential dispatch of Application, Database, Network agents
✅ Production-grade agents - ApplicationAgent and NetworkAgent with 95%+ test coverage
✅ CLI Interface - investigate-orchestrator command with budget management
✅ Cost Control - Per-agent budget tracking, transparent cost breakdown
✅ Hypothesis Ranking - Confidence-based ranking across all agents
✅ Graceful Degradation - Continues investigation even if agents fail
✅ OpenTelemetry Tracing - Distributed tracing from day 1

Recent Achievements (Phase 5):

Orchestrator: Sequential multi-agent coordination (15/15 tests passing)
Competitive Review: Agent Beta promoted for architectural simplification
Complexity Reduction: Removed ThreadPoolExecutor (saved 4 hours, zero threading bugs)
CLI Integration: Full investigation workflow from command line
Documentation: Comprehensive decision rationale and design docs

Previous Achievements:

Day 4: Agent LLM/MCP integration, ADR documentation (Handoff)
Day 3: OpenAI/Anthropic integration, fixed 8 critical bugs (Report)
Day 2: Scientific framework with quality-weighted confidence scoring (Report)

Next: Post-implementation competitive review, then Phase 6 optimization

Last Updated: 2025-11-21

Quick Start

Multi-Agent Investigation (Phase 5 - Production-Ready)

Investigate an incident using orchestrated multi-agent system:

# Simple investigation
python -m compass.cli.main investigate-orchestrator INC-12345

# With budget and affected services
python -m compass.cli.main investigate-orchestrator INC-12345 \
  --budget 15.00 \
  --affected-services payment,checkout \
  --severity critical

What you get:

Sequential dispatch of Application, Database, and Network agents
Observations consolidated from all agents
Top 5 hypotheses ranked by confidence
Per-agent cost breakdown with budget utilization

Example Output:

🔍 Initializing investigation for INC-12345
💰 Budget: $15.00
📊 Affected Services: payment, checkout
⚠️  Severity: critical

📊 Observing incident (sequential agent dispatch)...
✅ Collected 12 observations

🧠 Generating hypotheses...
✅ Generated 5 hypotheses

🏆 Top Hypotheses (ranked by confidence):

1. [network] DNS resolution timeout detected
   Confidence: 92.00%

2. [application] High error rate in payment service
   Confidence: 85.00%

3. [database] Connection pool nearing exhaustion
   Confidence: 78.00%

💰 Cost Breakdown:
  Application: $2.1500
  Database:    $1.8500
  Network:     $0.9500
  ─────────────────────────
  Total:       $4.9500 / $15.00
  Utilization: 33.0%

Try It with Demo Environment (Full Stack)

Complete demo environment with real observability stack:

# 1. Start demo environment
./scripts/run-demo.sh

# 2. Trigger an incident (missing index, lock contention, or pool exhaustion)
./scripts/trigger-incident.sh missing_index

# 3. Investigate with COMPASS (classic mode)
poetry run compass investigate \
  --service payment-service \
  --symptom "slow database queries and high latency" \
  --severity high

Full demo guide: DEMO.md (~10 minutes first run)

For Contributors

Start here: docs/product/COMPASS_Product_Reference_Document_v1_1.md
Understand the architecture: docs/architecture/COMPASS_MVP_Architecture_Reference.md
Build guide: docs/guides/COMPASS_MVP_Build_Guide.md
Development workflow: docs/guides/compass-tdd-workflow.md

Project Structure

compass/
├── docs/                      # All documentation
│   ├── architecture/          # System architecture documents
│   ├── product/               # Product strategy and requirements
│   ├── guides/                # Build guides and workflows
│   ├── reference/             # Quick references and indexes
│   └── research/              # Research papers (PDFs)
│
├── src/                       # Source code (in development)
│   ├── compass/               # Main Python package
│   │   ├── core/             # OODA loop, scientific framework
│   │   ├── agents/           # Agent implementations
│   │   ├── cli/              # CLI interface
│   │   ├── api/              # API server
│   │   └── integrations/     # MCP integrations
│   └── tests/                 # Test suite
│
├── planning/                  # Planning conversations
│   ├── conversations/         # Original HTML chats
│   └── transcripts/          # Extracted text transcripts
│
├── examples/                  # Example configurations and templates
│   ├── configurations/        # Sample YAML configs
│   └── templates/            # Agent templates
│
├── deployment/                # Deployment configurations
│   ├── k8s/                  # Kubernetes manifests
│   └── docker/               # Docker files
│
└── scripts/                   # Utility scripts

Core Concepts

What is COMPASS?

COMPASS uses AI agents organized according to Incident Command System (ICS) principles to investigate incidents using parallel OODA loops and scientific methodology.

Key Differentiators:

Parallel OODA Loops: 5+ agents test hypotheses simultaneously
Scientific Rigor: Systematic hypothesis disproof before human escalation
Learning Culture: Learning Teams methodology vs traditional RCA
Human-in-the-Loop: Level 1 autonomy - AI proposes, humans decide

Technology Stack

Language: Python only (readability over complexity)
Database: PostgreSQL + pgvector
Observability: LGTM stack (Loki, Grafana, Tempo, Mimir)
Deployment: Kubernetes (Tilt for local dev)
LLM: Provider agnostic (OpenAI, Anthropic, Copilot, Ollama)

Architecture Highlights

Agent Hierarchy (ICS-based):

Orchestrator
    ├── Database Manager → Workers
    ├── Network Manager → Workers
    ├── Application Manager → Workers
    └── Infrastructure Manager → Workers

OODA Loop Phases:

Observe: Parallel data gathering
Orient: Hypothesis generation and ranking
Decide: Human decision points
Act: Evidence gathering and hypothesis testing

Documentation Map

Essential Reading (Start Here)

Product Overview
- COMPASS_Product_Reference_Document_v1_1.md - Complete product specification
Architecture
- COMPASS_MVP_Architecture_Reference.md - MVP architecture
- COMPASS_MVP_Technical_Design.md - Technical design details
Build Guides
- COMPASS_MVP_Build_Guide.md - Step-by-step build instructions
- compass-tdd-workflow.md - TDD development process

Quick References

compass-quick-reference.md - Quick reference guide
COMPASS_CONVERSATIONS_INDEX.md - Searchable index of all planning conversations
INDEXING_SYSTEM_SUMMARY.md - How to use the conversation index

Specialized Topics

Scientific Framework:

COMPASS_SCIENTIFIC_FRAMEWORK_DOCS.md
compass_scientific_framework.py - Core implementation

Enterprise Features:

COMPASS_Enterprise_Knowledge_Architecture.md
compass_enterprise_knowledge_guide.md - Enterprise user guide

Human-AI Interface:

COMPASS_Interface_Architecture.md

Research Papers (in docs/research/):

ICS-Based Multi-Agent AI Systems for Incident Investigation
Evaluation of Learning Teams vs Root Cause Analysis
Problems with Root Cause Analysis

Development Status

✅ Completed

Product vision and requirements
Complete architecture design
Scientific framework specification
Multi-agent coordination design
Enterprise knowledge integration design
CLI interface design
Prototype code (scientific framework, database agent)
Comprehensive documentation
Test framework design

🏗️ In Progress

MVP implementation (not started)

📋 Roadmap

Phase 1: Foundation (Weeks 1-2)

Basic LGTM integration
Single agent (database)
CLI interface
Cost tracking

Phase 2: Trust (Weeks 3-4)

Hypothesis confidence scoring
Evidence linking
Graceful failure handling

Phase 3: Value (Weeks 5-6)

Pattern learning
Personal runbooks
Metrics tracking

Finding Information

Search Planning Conversations

All planning conversations are indexed and searchable:

# Search the conversation index
grep -i "topic_name" docs/reference/COMPASS_CONVERSATIONS_INDEX.md

# Example: Find information about cost management
grep -i "cost" docs/reference/COMPASS_CONVERSATIONS_INDEX.md

See docs/reference/INDEXING_SYSTEM_SUMMARY.md for detailed usage.

Documentation by Topic

Getting Started: docs/guides/
Architecture Details: docs/architecture/
Product Strategy: docs/product/
Research Background: docs/research/
Planning History: planning/

Key Design Principles

From docs/guides/claude.md:

Production-First: Every component production-ready from inception
Test-Driven Development: TDD rigorously from day 1
OODA Loop Focus: Optimize for iteration speed over perfect analysis
Scientific Method: Systematically disprove hypotheses before presenting
Human Authority: Humans decide, AI advises and accelerates
Cost Management: Token budget caps, transparent pricing
Learning Culture: Focus on contributing causes, not blame

Contributing

See development guides:

compass-tdd-workflow.md - Test-driven development
compass-claude-code-instructions.md - Claude Code workflow
compass-day1-startup.md - Day 1 setup guide

License

[To be determined]

Contact

[To be added]

Ready to build! See docs/guides/COMPASS_MVP_Build_Guide.md to get started.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.github		.github
.specify		.specify
docs		docs
examples		examples
observability		observability
planning		planning
postmortems		postmortems
scripts		scripts
src/compass		src/compass
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
CLAUDE_MD_UPGRADE_SUMMARY.md		CLAUDE_MD_UPGRADE_SUMMARY.md
CONTRIBUTING.md		CONTRIBUTING.md
DAY_2_COMPLETION_REPORT.md		DAY_2_COMPLETION_REPORT.md
DAY_2_CRITICAL_FIXES_SUMMARY.md		DAY_2_CRITICAL_FIXES_SUMMARY.md
DAY_3_COMPLETION_REPORT.md		DAY_3_COMPLETION_REPORT.md
DAY_3_TODO_STATUS.md		DAY_3_TODO_STATUS.md
DAY_4_HANDOFF.md		DAY_4_HANDOFF.md
DEMO.md		DEMO.md
GETTING_STARTED.md		GETTING_STARTED.md
GUIDE_ENHANCEMENT_SUMMARY.md		GUIDE_ENHANCEMENT_SUMMARY.md
MANUAL_TEST_PLAN.md		MANUAL_TEST_PLAN.md
MANUAL_TEST_PLAN_REVISED.md		MANUAL_TEST_PLAN_REVISED.md
Makefile		Makefile
ORGANIZATION_SUMMARY.md		ORGANIZATION_SUMMARY.md
P0_FIXES_COMPLETION_SUMMARY.md		P0_FIXES_COMPLETION_SUMMARY.md
PART_2_SUMMARY.md		PART_2_SUMMARY.md
PART_3_COMPLETION_SUMMARY.md		PART_3_COMPLETION_SUMMARY.md
PO_ASSESSMENT_COMPANY_A.md		PO_ASSESSMENT_COMPANY_A.md
PO_ASSESSMENT_COMPANY_B.md		PO_ASSESSMENT_COMPANY_B.md
PO_COMPETITION_FINAL_REVIEW.md		PO_COMPETITION_FINAL_REVIEW.md
README.md		README.md
REVIEW_AGENT_ALPHA_FINDINGS.md		REVIEW_AGENT_ALPHA_FINDINGS.md
REVIEW_AGENT_BETA_FINDINGS.md		REVIEW_AGENT_BETA_FINDINGS.md
REVIEW_AGENT_BETA_REPORT.md		REVIEW_AGENT_BETA_REPORT.md
Review_Agent_Beta_Phase1_Findings.md		Review_Agent_Beta_Phase1_Findings.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
claude.md		claude.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.mcp.yml		docker-compose.mcp.yml
docker-compose.observability.yml		docker-compose.observability.yml
docker-compose.test.yml		docker-compose.test.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
review_agent_alpha_orchestrator_plan.md		review_agent_alpha_orchestrator_plan.md
review_agent_alpha_phase1_mcp.md		review_agent_alpha_phase1_mcp.md
review_agent_alpha_phase2_findings.md		review_agent_alpha_phase2_findings.md
review_agent_alpha_production_engineer.md		review_agent_alpha_production_engineer.md
review_agent_beta_phase2_findings.md		review_agent_beta_phase2_findings.md
review_agent_beta_phase6_comprehensive.md		review_agent_beta_phase6_comprehensive.md
review_agent_delta_phase5_implementation.md		review_agent_delta_phase5_implementation.md
review_agent_gamma_phase5_implementation.md		review_agent_gamma_phase5_implementation.md
test_grafana_mcp.py		test_grafana_mcp.py
test_tempo_mcp.py		test_tempo_mcp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

COMPASS

What is COMPASS?

Project Status

Quick Start

Multi-Agent Investigation (Phase 5 - Production-Ready)

Try It with Demo Environment (Full Stack)

For Contributors

Project Structure

Core Concepts

What is COMPASS?

Technology Stack

Architecture Highlights

Documentation Map

Essential Reading (Start Here)

Quick References

Specialized Topics

Development Status

✅ Completed

🏗️ In Progress

📋 Roadmap

Finding Information

Search Planning Conversations

Documentation by Topic

Key Design Principles

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

IvanMerrill/compass

Folders and files

Latest commit

History

Repository files navigation

COMPASS

What is COMPASS?

Project Status

Quick Start

Multi-Agent Investigation (Phase 5 - Production-Ready)

Try It with Demo Environment (Full Stack)

For Contributors

Project Structure

Core Concepts

What is COMPASS?

Technology Stack

Architecture Highlights

Documentation Map

Essential Reading (Start Here)

Quick References

Specialized Topics

Development Status

✅ Completed

🏗️ In Progress

📋 Roadmap

Finding Information

Search Planning Conversations

Documentation by Topic

Key Design Principles

Contributing

License

Contact

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages