fix: adjust diskNormalized strategy to scale cost exponentially with disk utilization by jtuglu1 · Pull Request #19422 · apache/druid

jtuglu1 · 2026-05-06T19:22:15Z

Description

The existing linear penalization factor is still ineffective in large skew scenarios where the CostBalancerStrategy's cost forces a move/load (even with the utilization-based penalty). This switches the penalty to scale exponentially with the disk utilization, ensuring that near-full historicals are penalized.

Release note

Adjust diskNormalized strategy to scale cost exponentially with disk utilization

This PR has:

…sed on disk utilization

FrankChen021

Severity	Findings
P0	0
P1	1
P2	1
P3	0
Total	2

This is an automated review by Codex GPT-5

FrankChen021 · 2026-05-07T12:01:44Z

-    double normalizedCost = cost * usageRatio;
+    final double usageRatio = (double) server.getSizeUsed() / maxSize;
+    final double headroom = Math.max(EPSILON, 1.0 - usageRatio);
+    double normalizedCost = cost / headroom;


[P1] Existing threshold test now fails

Changing normalization to cost / headroom makes the existing testThresholdBlocksMarginalMove scenario choose DEST: source is roughly 38K / 0.20 * 0.95 = 180.5K, while dest is 40K / 0.26 = 153.8K. The test still asserts null, so the server test suite should fail unless the threshold scenario or algorithm is adjusted.

FrankChen021 · 2026-05-07T12:01:44Z

 * A {@link BalancerStrategy} which normalizes the cost of placing a segment on a
- * server as calculated by {@link CostBalancerStrategy} by multiplying it by the
- * server's disk usage ratio.
+ * server as calculated by {@link CostBalancerStrategy} by dividing by the


[P2] Public docs still describe the old formula

The implementation and Javadoc now divide by available headroom, but docs/design/coordinator.md and docs/configuration/index.md still say diskNormalized multiplies cost by diskUsed / maxSize. That leaves user-facing behavior documentation incorrect for this config option.

gianm

Have you tried this out in a real cluster? I wonder if we should have a property to restore the old behavior in case something undesirable happens.

I also struggle with the naming. The term "normalized" makes me think that costs will be scaled somehow into some bounded range. The adjusted cost function in the PR is doing something different from normalization, it's blowing up the cost as servers fill up.

I wonder if this could all be cleaner by having the cost adjustments be implemented as options to the cost balancer rather than entirely distinct balancer strategies. What are your thoughts on this?

Btw, the docs would need to be updated with this formula change, since the "multiplies the cost of placing a segment on a server by the server's disk usage ratio (diskUsed / maxSize)" is no longer what happens.

jtuglu1 force-pushed the fix-disk-normalized-strategy branch from b3521f9 to 9064f7e Compare May 6, 2026 19:22

fix: adjust diskNormalized strategy to exponentially penalize cost ba…

fe0d0d3

…sed on disk utilization

jtuglu1 force-pushed the fix-disk-normalized-strategy branch from 9064f7e to fe0d0d3 Compare May 6, 2026 19:28

FrankChen021 reviewed May 7, 2026

View reviewed changes

gianm reviewed May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: adjust diskNormalized strategy to scale cost exponentially with disk utilization#19422

fix: adjust diskNormalized strategy to scale cost exponentially with disk utilization#19422
jtuglu1 wants to merge 1 commit into
apache:masterfrom
jtuglu1:fix-disk-normalized-strategy

jtuglu1 commented May 6, 2026 •

edited

Loading

Uh oh!

FrankChen021 left a comment

Uh oh!

FrankChen021 May 7, 2026

Uh oh!

FrankChen021 May 7, 2026

Uh oh!

gianm left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jtuglu1 commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Release note

Uh oh!

FrankChen021 left a comment

Choose a reason for hiding this comment

Uh oh!

FrankChen021 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

FrankChen021 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

gianm left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jtuglu1 commented May 6, 2026 •

edited

Loading