Added Conversation Compression System #1343

tim-inkeep · 2025-12-19T07:28:31Z

New compression services: Add ConversationCompressor and update conversation history handling to use getConversationHistoryWithCompression()
Code organization: Extract generation logic into helper methods (setupGenerationContext, loadToolsAndPrompts, buildConversationHistory, etc.) to reduce complexity in main generateResponse method
JSON post-processing: Add withJsonPostProcessing utility for better response handling

changeset-bot · 2025-12-19T07:28:35Z

🦋 Changeset detected

Latest commit: 9e5ecb5

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 9 packages

Name	Type
@inkeep/agents-run-api	Patch
@inkeep/agents-cli	Patch
@inkeep/agents-manage-api	Patch
@inkeep/agents-manage-ui	Patch
@inkeep/agents-core	Patch
@inkeep/agents-manage-mcp	Patch
@inkeep/agents-sdk	Patch
@inkeep/ai-sdk-provider	Patch
@inkeep/create-agents	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

vercel · 2025-12-19T07:28:35Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
agents-docs	Ready	Preview, Comment	Jan 5, 2026 7:04pm
agents-manage-api	Ready	Preview, Comment	Jan 5, 2026 7:04pm
agents-manage-ui	Ready	Preview, Comment	Jan 5, 2026 7:04pm
agents-run-api	Ready	Preview, Comment	Jan 5, 2026 7:04pm

claude · 2025-12-19T07:28:55Z

Claude finished @tim-inkeep's task —— View job

Code Review: Conversation Compression System

Having conducted a comprehensive analysis of this conversation compression system PR, I'll provide feedback focusing on the most critical architectural and engineering considerations.

Review Checklist:

Read repository guidelines and existing architecture
Review new compression system components
Analyze Agent.ts refactoring and generation logic changes
Evaluate system design and architectural decisions
Check error handling and edge cases
Assess performance and scalability implications
Verify API contract consistency and backwards compatibility
Provide actionable feedback and recommendations

Architecture & System Design Review

✅ Strong Architectural Decisions

1. Inheritance-Based Compression Hierarchy
The BaseCompressor → ConversationCompressor / MidGenerationCompressor architecture follows solid OOP principles. The separation between conversation-level (50% context window) and mid-generation (aggressive) compression is well-designed for different use cases.

2. Clean Integration with Existing Message Flow
The integration with getConversationHistoryWithCompression() maintains backward compatibility while adding sophisticated compression logic without breaking existing conversation handling.

🚨 Critical Areas for Improvement

1. Memory Management & Potential Memory Leaks

// BaseCompressor.ts:36-37
protected processedToolCalls = new Set<string>();
protected cumulativeSummary: ConversationSummary | null = null;

Issue: These class-level sets and objects will grow indefinitely during long-running conversations. There's no cleanup mechanism.

Recommendation: Implement periodic cleanup based on message age or implement a TTL mechanism:

// Add to BaseCompressor
private cleanup() {
  // Clean tool calls older than X hours
  const cutoff = Date.now() - (6 * 60 * 60 * 1000); // 6 hours
  this.processedToolCalls = new Set(Array.from(this.processedToolCalls).slice(-100));
}

2. Race Conditions in Compression Logic

// conversations.ts:500-506
const newMessagesCompressed = await compressConversationIfNeeded(messagesAfterCompression, {
  conversationId, tenantId, projectId, summarizerModel, streamRequestId,
});

Issue: Multiple concurrent requests could trigger compression simultaneously for the same conversation, potentially creating duplicate compression summaries.

Recommendation: Implement conversation-level locking or atomic compression checks using database constraints.

3. Token Estimation Accuracy

// BaseCompressor.ts:58-62  
protected estimateTokens(content: any): number {
  const text = typeof content === 'string' ? content : JSON.stringify(content);
  return Math.ceil(text.length / 4);
}

Issue: The 4:1 character-to-token ratio is a rough approximation that can be significantly inaccurate, especially for code, JSON, or multilingual content. This could lead to under/over-compression.

Recommendation: Use the actual model's tokenizer when available, or implement model-specific estimation:

// Consider using tiktoken for OpenAI models or similar libraries for other providers
protected estimateTokens(content: any, model?: string): number {
  // Use actual tokenizer when available
  if (this.baseModel?.provider === 'openai' && tiktoken) {
    return tiktoken.encode(text).length;
  }
  // Fallback with model-specific ratios
  return this.getModelSpecificEstimate(text, model);
}

4. Error Boundary & Fallback Handling
The compression system lacks comprehensive error boundaries. If compression fails, there's no graceful fallback to simple message truncation, which could cause generation failures.

Recommendation: Add fallback mechanisms:

async compress(messages: any[]): Promise<CompressionResult> {
  try {
    return await this.performCompression(messages);
  } catch (error) {
    logger.error({ error }, 'Compression failed, falling back to simple truncation');
    return this.fallbackToSimpleCompression(messages);
  }
}

🏗️ Code Organization & Maintainability

1. Agent.ts Refactoring Quality
The extraction of helper methods (setupGenerationContext, loadToolsAndPrompts, buildConversationHistory) significantly improves readability. However, the main generate() method is still complex (~200 lines).

Recommendation: Consider further extraction:

// Extract phase execution into separate methods
private async executePhase1(config: Phase1Config): Promise<GenerationResult>
private async executePhase2(config: Phase2Config): Promise<StructuredResult>

2. Tool Result Artifact Creation Logic

// BaseCompressor.ts:112-325 (saveToolResultsAsArtifacts)

This method is doing too many things: format conversion, filtering, deduplication, and artifact creation. It's 200+ lines and violates single responsibility principle.

Recommendation: Break into smaller, focused methods:

private async processMessage(message: any): Promise<ToolCallArtifact[]>
private async createArtifactForToolCall(toolCall: ToolCall): Promise<string>
private shouldSkipToolCall(toolName: string): boolean

🔒 Security & Data Handling

1. Tool Result Content Sanitization

// BaseCompressor.ts:436-456 (removeStructureHints)

While structure hints are removed, there's no validation for potentially sensitive content in tool results before compression.

Recommendation: Add content sanitization before compression:

private sanitizeToolResult(result: any): any {
  // Remove potential secrets, API keys, etc.
  return this.recursivelyCleanSensitiveData(result);
}

2. Artifact Metadata Consistency
The artifact creation includes various metadata fields but lacks consistent validation of required fields.

⚡ Performance Considerations

1. N+1 Query Pattern

// BaseCompressor.ts:195-199
const existingArtifacts = await getLedgerArtifacts(dbClient)({
  scopes: { tenantId: this.tenantId, projectId: this.projectId },
  toolCallId: block.toolCallId,
});

This creates individual database queries for each tool call, leading to N+1 performance issues.

Recommendation: Batch the artifact lookups:

// Collect all toolCallIds first, then query in batch
const toolCallIds = extractToolCallIds(messages);
const existingArtifacts = await getLedgerArtifactsBatch(dbClient)({
  scopes: { tenantId: this.tenantId, projectId: this.projectId },
  toolCallIds,
});

2. Context Size Calculation Efficiency
The context size calculation is performed multiple times with similar logic across different classes.

📋 Testing & Documentation Gaps

1. Missing Edge Case Tests
The PR lacks tests for critical edge cases:

Compression during concurrent requests
Compression failure scenarios
Memory cleanup behavior
Large conversation handling (10K+ messages)

2. Documentation for Model-Specific Behavior
The compression logic varies by model but this isn't clearly documented. The 50% vs aggressive compression thresholds need clearer explanation.

🔄 API Contracts & Backwards Compatibility

1. Message Format Evolution
The PR introduces new message types (compression_summary) but doesn't include migration logic for existing conversations.

Recommendation: Add database migration for existing conversations:

-- Add migration to handle existing conversations without compression metadata
ALTER TABLE conversations ADD COLUMN compression_version INTEGER DEFAULT 1;

🎯 Specific Recommendations for Implementation

Immediate (Critical): Implement conversation-level locking for compression operations
Short-term: Add comprehensive error boundaries with fallback mechanisms
Medium-term: Refactor saveToolResultsAsArtifacts into smaller, testable components
Long-term: Implement proper tokenization for accurate context size estimation

📊 Overall Assessment

This is a well-architected feature that addresses a real need for conversation context management. The inheritance hierarchy and integration approach are sound. However, the implementation has several production-readiness concerns around memory management, race conditions, and error handling that should be addressed before merge.

The refactoring of Agent.ts improves maintainability, and the compression system provides a solid foundation for scaling to longer conversations.

Severity Level: Medium-High - The core functionality is solid, but the production reliability concerns (memory leaks, race conditions, error handling) need to be addressed to ensure system stability under load.

github-actions · 2026-01-05T19:41:45Z

🔎💬 Inkeep AI search and chat service is syncing content for source 'Inkeep Agent Framework Docs'

tim-inkeep added 6 commits December 17, 2025 22:48

added compression and merged

858d080

made working conversation compression!

6d2a6dd

removed debug logging

e60982b

added changeset

dd0381b

formatted

267a34d

formatted

fa3c592

Merge branch 'main' into feature/conversation-compression-v2

4173ef0

vercel bot deployed to Preview – agents-run-api December 19, 2025 07:29 View deployment

vercel bot deployed to Preview – agents-manage-api December 19, 2025 07:30 View deployment

vercel bot deployed to Preview – agents-manage-ui December 19, 2025 07:31 View deployment

vercel bot deployed to Preview – agents-docs December 19, 2025 07:31 View deployment

tim-inkeep added 2 commits December 19, 2025 15:48

updated

7f8c0d9

formatted

abb7cd1

vercel bot deployed to Preview – agents-run-api December 19, 2025 20:51 View deployment

vercel bot deployed to Preview – agents-manage-api December 19, 2025 20:51 View deployment

vercel bot deployed to Preview – agents-docs December 19, 2025 20:52 View deployment

vercel bot deployed to Preview – agents-manage-ui December 19, 2025 20:52 View deployment

updated test

2f102d3

vercel bot temporarily deployed to Preview – agents-docs December 19, 2025 21:06 Inactive

vercel bot deployed to Preview – agents-run-api December 19, 2025 21:07 View deployment

vercel bot deployed to Preview – agents-manage-api December 19, 2025 21:07 View deployment

vercel bot deployed to Preview – agents-manage-ui December 19, 2025 21:09 View deployment

tim-inkeep added 2 commits December 22, 2025 00:19

updated

805c0db

Merge branch 'main' into feature/conversation-compression-v2

b43599e

vercel bot deployed to Preview – agents-run-api December 22, 2025 05:20 View deployment

vercel bot deployed to Preview – agents-manage-api December 22, 2025 05:20 View deployment

vercel bot deployed to Preview – agents-manage-ui December 22, 2025 05:22 View deployment

vercel bot deployed to Preview – agents-docs January 5, 2026 17:06 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 17:06 View deployment

updated test

73b9eba

vercel bot temporarily deployed to Preview – agents-docs January 5, 2026 17:24 Inactive

vercel bot deployed to Preview – agents-run-api January 5, 2026 17:25 View deployment

vercel bot deployed to Preview – agents-manage-api January 5, 2026 17:25 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 17:26 View deployment

biome hceck

311792e

vercel bot deployed to Preview – agents-manage-api January 5, 2026 17:36 View deployment

vercel bot deployed to Preview – agents-run-api January 5, 2026 17:36 View deployment

vercel bot deployed to Preview – agents-docs January 5, 2026 17:37 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 17:37 View deployment

updated

5f83589

vercel bot deployed to Preview – agents-manage-api January 5, 2026 18:21 View deployment

vercel bot deployed to Preview – agents-run-api January 5, 2026 18:22 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 18:23 View deployment

vercel bot deployed to Preview – agents-docs January 5, 2026 18:23 View deployment

updated docs

cfe07ba

vercel bot deployed to Preview – agents-manage-api January 5, 2026 18:46 View deployment

vercel bot deployed to Preview – agents-run-api January 5, 2026 18:47 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 18:48 View deployment

vercel bot deployed to Preview – agents-docs January 5, 2026 18:48 View deployment

formatted badly

9e5ecb5

vercel bot deployed to Preview – agents-manage-api January 5, 2026 19:03 View deployment

vercel bot deployed to Preview – agents-run-api January 5, 2026 19:03 View deployment

vercel bot deployed to Preview – agents-manage-ui January 5, 2026 19:04 View deployment

vercel bot deployed to Preview – agents-docs January 5, 2026 19:04 View deployment

tim-inkeep merged commit 153d4e5 into main Jan 5, 2026
8 checks passed

tim-inkeep deleted the feature/conversation-compression-v2 branch January 5, 2026 19:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Conversation Compression System #1343

Added Conversation Compression System #1343

Uh oh!

tim-inkeep commented Dec 19, 2025

Uh oh!

changeset-bot bot commented Dec 19, 2025 •

edited

Loading

Uh oh!

vercel bot commented Dec 19, 2025 •

edited

Loading

Uh oh!

claude bot commented Dec 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

github-actions bot commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added Conversation Compression System #1343

Added Conversation Compression System #1343

Uh oh!

Conversation

tim-inkeep commented Dec 19, 2025

Uh oh!

changeset-bot bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

vercel bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: Conversation Compression System

Review Checklist:

Architecture & System Design Review

✅ Strong Architectural Decisions

🚨 Critical Areas for Improvement

🏗️ Code Organization & Maintainability

🔒 Security & Data Handling

⚡ Performance Considerations

📋 Testing & Documentation Gaps

🔄 API Contracts & Backwards Compatibility

🎯 Specific Recommendations for Implementation

📊 Overall Assessment

Uh oh!

Uh oh!

github-actions bot commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

changeset-bot bot commented Dec 19, 2025 •

edited

Loading

vercel bot commented Dec 19, 2025 •

edited

Loading

claude bot commented Dec 19, 2025 •

edited

Loading