MAKER Framework

Overview

MAKER stands for Maximal Agentic decomposition, first-to-ahead-by-K Error correction, and Red-flagging. It is an AI orchestration framework designed to enhance the reliability and accuracy of AI systems through consensus-based voting, proactive validation, and intelligent task decomposition. The framework is based on the paper titled Solving a million-step LLM task with zero errors by Cognizant AI Lab.

Key Features

Dual-Phase Orchestration: Separates planning and execution into distinct phases, each with its own AI models and voting mechanisms for specialized task handling.
Consensus-Based Voting System: Implements a "first-to-ahead-by-K" voting algorithm where multiple AI agents vote on proposed plans and execution outputs. A decision is accepted only when the margin of agreement exceeds K votes, ensuring high-confidence outcomes.
Adaptive Task Decomposition: Breaks down complex prompts into granular, executable steps with dependency tracking. Steps can specify required prerequisites and format requirements for precise control flow.
Real-Time Red-Flag Validation: Employs extensible validators that automatically retry requests when outputs fail quality checks, preventing invalid responses from propagating through the system.
Batch Processing with Configurable Parallelism: Processes steps in configurable batch sizes with interleaved concurrent voting to optimize throughput while maintaining quality control.
Event-Driven Progress Tracking: Provides rich event hooks for real-time monitoring and visualization of planning and execution progress.
Rejection Feedback Loop: When votes reject a proposal, the system automatically incorporates rejection reasons into the next iteration, enabling continuous refinement.
Format-Aware Execution: Supports custom output formats (JSON, plaintext, etc.) that can be injected into specific steps requiring structured data.

Working Principle

The MAKER framework operates through a sophisticated dual-phase orchestration system that separates planning and execution, each employing consensus-based voting and error correction mechanisms. Here's how it works:

Architecture Overview

The framework consists of three main components:

Executor: The main entry point that coordinates both planning and execution phases
PlanningOrchestrator: Manages task decomposition and plan validation
ExecutionOrchestrator: Handles step execution and output verification

Each orchestrator uses dedicated AI clients:

Planning Client: Generates task decomposition proposals
Plan Voting Client: Validates proposed plans through consensus
Execution Client: Executes individual steps
Execution Voting Client: Validates execution outputs

Phase 1: Planning Phase

Task Decomposition: The planning phase breaks down complex tasks into granular, executable steps through an iterative process:

The Planning Client proposes a batch of steps (configurable batch size, default 2)
Each proposed step includes:
- Task: Description of the step to perform
- RequiredSteps: Dependencies (indices of prerequisite steps)
- RequiresFormat: Flag indicating if the step needs format information
- ExtraContext: Additional context for step execution

Consensus Voting (First-to-Ahead-by-K): After steps are proposed, the Plan Voting Client evaluates them:

Multiple voting agents (K agents) simultaneously review the proposal
Votes can be: "Yes" (approve), "No" (reject with reason), or "End" (task complete)
The voting continues until one of these conditions is met:
- Approval: Yes votes >= No votes + K — The proposal is accepted
- Rejection: No votes >= Yes votes + K — The proposal is rejected
- Completion: End votes = K — All voters agree the task is complete
- Contentious: Total votes >= K × 4 without consensus — Voting round fails

Rejection Feedback Loop: When a proposal is rejected:

Rejection reasons from "No" votes are collected
The framework retries with the rejection feedback incorporated into the next proposal
If retries exceed the maximum (default 5), planning restarts from scratch
This ensures continuous refinement until a high-quality plan emerges

Plan Completion: The planning phase continues proposing and voting on new steps until a step with Task = "End" is accepted, indicating the plan is complete.

Phase 2: Execution Phase

Batch Processing: Execution processes the planned steps in configurable batches (default 2 steps per batch):

Steps are executed in order, respecting RequiredSteps dependencies
Each batch maintains a cumulative state object that evolves as steps complete
Format information is injected into prompts when RequiresFormat = true

Step Execution: For each batch of steps:

The Execution Client receives:
- The original task description
- Current batch of steps to execute
- Previous state (output from prior batches)
- Output format specification (if RequiresFormat = true for any step in the batch)
- Extra context (if ExtraContext is specified for any step in the batch)
- Rejection feedback (if previous attempt failed)
The client produces a new state representing the updated solution

Context Injection: Steps can provide additional information through two specialized fields:

Format Context (RequiresFormat): When set to true, the global output format specification is injected into the execution prompt, guiding the AI to produce correctly structured output (e.g., JSON schema, specific data formats)
Extra Context (ExtraContext): Allows individual steps to carry step-specific instructions, constraints, or reference data that supplement the main task description. This is particularly useful for steps requiring specialized knowledge or particular execution guidelines

Both context types are dynamically inserted into the execution prompt only when present, keeping prompts lean while enabling precise control over complex execution scenarios.

Output Validation: After each execution, the Execution Voting Client validates the output:

Uses the same first-to-ahead-by-K voting mechanism as planning
Compares the new state against the previous state and task requirements
Voters evaluate whether the steps were correctly executed
Rejection reasons guide retry attempts

Retry Mechanism: If execution is rejected:

Rejection reasons are incorporated into the next execution attempt
The same batch is re-executed with feedback (up to 5 retries by default)
If max retries exceeded, state is reset and one final attempt is made
This ensures eventual convergence to valid outputs

Red-Flag Validation System

Throughout both phases, the framework employs extensible validators that act as quality gates:

Guarded Requests: Every AI client request is wrapped in a "guarded" call that:

Sends the prompt to the AI client
Extracts JSON from code blocks if present (using regex matching)
Runs the response through all configured validators
If validation fails (throws AIRedFlagException):
- The failure message is appended to the original prompt
- The request is automatically retried
- This loop continues until validation passes

Built-in Validators:

AIRedFlagMinLengthValidator: Ensures responses meet minimum length requirements
- Planning: 100 characters minimum
- Voting: 2 characters minimum

Extensibility: Custom validators can be added by implementing IAIRedFlagValidator, enabling domain-specific quality checks (e.g., JSON schema validation, constraint verification, content safety).

Interleaved Concurrent Voting

To optimize throughput while maintaining quality, the framework uses interleaved execution:

K voting requests are generated simultaneously
Results are processed using the TaskUtils.Interleaved() utility
This allows votes to be evaluated as they complete (not waiting for all K votes)
Voting can terminate early once the K-margin threshold is reached
Invalid votes trigger immediate replacement vote generation

State Management

The execution phase maintains a cumulative state:

Initial State: Empty string
Step Execution: Each batch transforms the state
State Evolution: New state is validated against previous state
Format Awareness: Steps marked with RequiresFormat receive the output format specification
Final State: The last batch's state is the final solution

Event-Driven Monitoring

The framework provides rich event hooks for real-time observation:

Planning Events:

OnStepsProposed: Fired when new steps are proposed
OnStepsAdded: Fired when steps are accepted and added to the plan
OnStepsRejected: Fired when a proposal is rejected
OnPlanVoteChanged: Fired as each vote is received

Execution Events:

OnExecutionStarted: Fired when a batch begins execution
OnStateChanged: Fired when state is updated
OnExecutionVoteChanged: Fired as each execution vote is received

These events enable real-time visualization, progress tracking, and debugging without affecting the core orchestration logic.

Error Correction Mechanisms

The framework employs multiple layers of error correction:

Red-Flag Validators: Catch malformed responses before they enter the voting system
Consensus Voting: Reject low-quality proposals through democratic agreement
Rejection Feedback: Incorporate failure reasons into retry attempts
Retry Limits: Prevent infinite loops while allowing sufficient correction attempts
State Reset: Clear corrupted state after exhausting retries
Contentious Detection: Identify when voters fundamentally disagree and fail gracefully

Scalability and Performance

Configurable Parallelism:

Batch size determines how many steps are planned/executed at once
K value controls the consensus threshold (higher K = more agents = higher confidence)
Interleaved voting processes votes as they arrive, not in rigid batches

Optimal Settings:

Smaller batches (1-2 steps): More granular control, slower execution
Larger batches (5-10 steps): Faster execution, less quality control
Higher K values (8-15): Higher confidence, more AI calls
Lower K values (3-5): Faster decisions, lower confidence

The framework balances speed and reliability through these tunable parameters, adapting to different use cases from high-stakes tasks requiring zero errors to rapid prototyping scenarios.

Use as MCP Server

The easiest way to use MAKER is via the published npm package (@igniteui/maker-mcp), which runs the MCP server in any MCP-compatible client (Claude Desktop, VS Code, Cursor, etc.) with no .NET installation required. The native binary (~50 MB) is downloaded and cached on first run.

Prerequisites

Node.js ≥ 18
At least one AI provider API key:
A GitHub Personal Access Token with read:packages scope (create one here)

One-time registry setup

This package is published on GitHub Packages. Run these once on your machine:

npm config set @igniteui:registry https://npm.pkg.github.com

Then open ~/.npmrc (Windows: %USERPROFILE%\.npmrc) and add:

//npm.pkg.github.com/:_authToken=YOUR_GITHUB_PAT

You can also place these settings in a project-level .npmrc file:

@igniteui:registry=https://npm.pkg.github.com
//npm.pkg.github.com/:_authToken=${GITHUB_TOKEN}

Installation

Claude Desktop

Open your Claude Desktop config file:
- Windows: %APPDATA%\Claude\claude_desktop_config.json
- macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Add the maker server to the mcpServers block:

{
  "mcpServers": {
    "maker": {
      "command": "npx",
      "args": ["-y", "@igniteui/maker-mcp", "--stdio"],
      "env": {
        "Executor__AIProviderKeys__OpenAI": "<your-openai-key>"
      }
    }
  }
}

Restart Claude Desktop. The first startup downloads the binary for your platform (~30 s on a typical connection).

VS Code (GitHub Copilot)

Add to your .vscode/mcp.json (workspace) or user MCP settings:

{
  "servers": {
    "maker": {
      "command": "npx",
      "args": ["-y", "@igniteui/maker-mcp", "--stdio"],
      "env": {
        "Executor__AIProviderKeys__OpenAI": "<your-openai-key>"
      }
    }
  }
}

Switch Copilot Chat to Agent mode and click the 🔧 tools icon to confirm maker_plan, maker_execute, and maker_plan_and_execute are listed.

Other MCP Clients

Any client that supports the MCP STDIO transport can use:

npx -y @igniteui/maker-mcp --stdio

Pass API keys as environment variables (see Configuration below).

Configuration

All configuration is supplied through environment variables. The naming convention uses double-underscore (__) as the section separator.

AI Provider Keys

Environment Variable	Description
`Executor__AIProviderKeys__OpenAI`	OpenAI API key
`Executor__AIProviderKeys__Anthropic`	Anthropic API key
`Executor__AIProviderKeys__Google`	Google AI API key

You only need to set keys for the providers you actually use.

Model Selection

Each of the four internal clients can target a different provider and model:

Environment Variable	Default	Description
`Executor__Clients__Planning__Provider`	`OpenAI`	Provider for step proposal
`Executor__Clients__Planning__Model`	`gpt-5.1`	Model for step proposal
`Executor__Clients__PlanVoting__Provider`	`OpenAI`	Provider for plan voting
`Executor__Clients__PlanVoting__Model`	`gpt-5.1`	Model for plan voting
`Executor__Clients__Execution__Provider`	`OpenAI`	Provider for step execution
`Executor__Clients__Execution__Model`	`gpt-5.1`	Model for step execution
`Executor__Clients__ExecutionVoting__Provider`	`OpenAI`	Provider for execution voting
`Executor__Clients__ExecutionVoting__Model`	`gpt-5.1`	Model for execution voting

Valid Provider values: OpenAI, Anthropic, Google.

Example — mix providers to balance cost and quality:

"env": {
  "Executor__AIProviderKeys__OpenAI":    "<openai-key>",
  "Executor__AIProviderKeys__Anthropic": "<anthropic-key>",
  "Executor__Clients__Planning__Provider":        "Anthropic",
  "Executor__Clients__Planning__Model":           "claude-opus-4-5",
  "Executor__Clients__PlanVoting__Provider":      "OpenAI",
  "Executor__Clients__PlanVoting__Model":         "gpt-4.1-mini",
  "Executor__Clients__Execution__Provider":       "Anthropic",
  "Executor__Clients__Execution__Model":          "claude-opus-4-5",
  "Executor__Clients__ExecutionVoting__Provider": "OpenAI",
  "Executor__Clients__ExecutionVoting__Model":    "gpt-4.1-mini"
}

Tuning Parameters

Each MCP tool call accepts optional parameters to control execution behavior:

Parameter	Default	Description
`batchSize`	`2`	Number of steps proposed/executed per round. Higher values (5-10) mean fewer rounds and faster execution but larger prompts and less granular control. Lower values (1-2) provide more control but slower execution.
`k`	`10`	Voting consensus threshold — the margin by which one outcome must lead to win the vote. Higher values (15-20) require stronger agreement, using more tokens but increasing confidence. Lower values (3-5) enable faster decisions with less agreement.

These parameters are specified in natural language when calling the tools (see examples below).

Available MCP Tools

Tool	Description
`maker_plan`	Decompose a task into a validated, ordered step list
`maker_execute`	Execute a step list produced by `maker_plan`
`maker_plan_and_execute`	Plan and execute in one call with live progress

maker_plan

Decomposes a task into a validated, ordered list of steps without executing them. Returns a JSON array of Step objects.

Parameters

Name	Type	Default	Description
`prompt`	`string`	—	The task or goal to plan
`batchSize`	`integer`	`2`	Steps proposed per voting round
`k`	`integer`	`10`	Consensus threshold

Returns — a JSON array:

[
  {
    "task": "Analyse the existing component API surface",
    "requiredSteps": [],
    "requiresFormat": false,
    "extraContext": ""
  },
  {
    "task": "Identify breaking changes relative to the previous major version",
    "requiredSteps": [0],
    "requiresFormat": false,
    "extraContext": ""
  }
]

maker_execute

Executes a step list produced by maker_plan. Each batch is validated by the voting system before the state is advanced.

Parameters

Name	Type	Default	Description
`stepsJson`	`string`	—	JSON array from `maker_plan`
`prompt`	`string`	—	The original task (provides context)
`batchSize`	`integer`	`2`	Steps executed per round
`k`	`integer`	`10`	Consensus threshold

Returns — the final accumulated result as a string.

maker_plan_and_execute

Convenience tool that runs both phases in a single call. Streaming progress events are sent between phases.

Parameters

Name	Type	Default	Description
`prompt`	`string`	—	The task or goal
`batchSize`	`integer`	`2`	Steps per round (planning and execution)
`k`	`integer`	`10`	Consensus threshold

Returns — the final accumulated result as a string.

Usage Examples

Simple one-shot task (default settings)

Ask your MCP client (e.g. Claude):

Use maker_plan_and_execute to write a detailed comparison of REST vs GraphQL for a technical blog post.

This uses the defaults: batchSize=2 and k=10.

Inspect the plan before executing

Use maker_plan to create a plan for migrating a PostgreSQL schema to a multi-tenant design.

Show me the steps.

Use maker_execute with those steps to carry out the migration script.

Fast execution with larger batches

For faster execution at the cost of less granular control:

Use maker_plan_and_execute with batchSize=5 and k=10 to generate unit tests for my authentication module.

High-confidence execution with stricter consensus

For critical tasks requiring maximum confidence:

Use maker_plan_and_execute with batchSize=2 and k=15 to review this smart contract for security vulnerabilities.

Low-cost exploratory task

For quick prototyping or low-stakes tasks, reduce both parameters to minimize token usage:

Use maker_plan_and_execute with batchSize=3 and k=3 to draft a project README for a Node.js CLI tool.

Caching

The native binary is cached after the first download so subsequent starts are instant.

Platform	Default cache location
Windows	`%LOCALAPPDATA%\maker-mcp\{version}\{rid}\`
macOS / Linux	`~/.cache/maker-mcp/{version}/{rid}/`

Override with the MAKER_MCP_CACHE environment variable:

"env": {
  "MAKER_MCP_CACHE": "/opt/maker-mcp-cache"
}

To force a re-download, delete the cache directory and restart your MCP client.

Supported Platforms

Platform	Architecture	RID
Windows	x64	`win-x64`
macOS	x64 (Intel)	`osx-x64`
macOS	arm64 (Apple Silicon)	`osx-arm64`
Linux	x64	`linux-x64`

Troubleshooting

macOS quarantine warning

If macOS blocks the binary after download, remove the quarantine attribute:

xattr -d com.apple.quarantine ~/.cache/maker-mcp/<version>/osx-arm64/maker-mcp

Binary not found / HTTP error on first run

The bootstrap script downloads from GitHub Releases. Make sure the version in package.json has a corresponding GitHub Release with the platform tarballs attached. Check your network/proxy settings if the download fails.

All votes reject every proposal

Verify your API key is valid and has sufficient quota.
Confirm the chosen model name is correct for the selected provider.
Try lowering k to 3 to make consensus easier to reach while debugging.

Unsupported platform error

Only the four RIDs listed above are supported. ARM Linux is not currently packaged. Open an issue at github.com/IgniteUI/MAKER to request additional platforms.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
MAKER.McpServer		MAKER.McpServer
MAKER.Npm		MAKER.Npm
MAKER.Tests		MAKER.Tests
MAKER.WebApp		MAKER.WebApp
MAKER		MAKER
snippets		snippets
.gitattributes		.gitattributes
.gitignore		.gitignore
MAKER.sln		MAKER.sln
README.md		README.md
package-lock.json		package-lock.json

Folders and files

Latest commit

History

Repository files navigation

MAKER Framework

Overview

Key Features

Working Principle

Architecture Overview

Phase 1: Planning Phase

Phase 2: Execution Phase

Red-Flag Validation System

Interleaved Concurrent Voting

State Management

Event-Driven Monitoring

Error Correction Mechanisms

Scalability and Performance

Use as MCP Server

Prerequisites

One-time registry setup

Installation

Claude Desktop

VS Code (GitHub Copilot)

Other MCP Clients

Configuration

AI Provider Keys

Model Selection

Tuning Parameters

Available MCP Tools

maker_plan

maker_execute

maker_plan_and_execute

Usage Examples

Simple one-shot task (default settings)

Inspect the plan before executing

Fast execution with larger batches

High-confidence execution with stricter consensus

Low-cost exploratory task

Caching

Supported Platforms

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages