Skip to content

Conversation

@HerveRiviere
Copy link
Collaborator

@HerveRiviere HerveRiviere commented Nov 4, 2025

"TopicPartitionWriters" Map is being cleared on every close call, which is correct if we consider the stop the world scenario. But in Incremental mode there can be case that only few of the partitions have been revoked and task has some partitions with it to read from, In that scenario it will be throwing NullPointerException at the following part of the code.

As a consequence the add or the removal of an instance create a high pressure on all instances as they need to recover from last commited file for all the task of the cluster

Applying unmerged commit from original project : confluentinc#712

Example of CPU usage before the patch when we add / remove one instance
image

Will update the PR with screenshot after the patch

"TopicPartitionWriters" Map is being cleared on every close call, which is correct if we consider the stop the world scenario. But in Incremental mode there can be case that only few of the partitions have been revoked and task has some partitions with it to read from, In that scenario it will be throwing NullPointerException at the following part of the code.

As a consequence the add or the removal of an instance create a high pressure on all instances as they need to recover from last commited file for all the task of the cluster

Applying unmerged commit from original project : confluentinc#712
@HerveRiviere HerveRiviere merged commit 94937b4 into criteo-fork Nov 13, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants