Interval tree for managing segment metadata in memory by pirvtech · Pull Request #19138 · apache/druid

pirvtech · 2026-03-11T21:06:55Z

Segment metadata stored in memory of the Historicals, is used when looking up segments that match an interval for query and segment loading purposes. Currently this is a serial scan that goes through all segments metadata in ascending start time order to find the right matching segments.

This changes introduces an Interval Tree as a more efficient way to store segment metadata in memory, to speed up searches for segments, that can potentially cut down search times from O(n) to O(logn).

Core changes to file processing/src/main/java/org/apache/druid/timeline/VersionedIntervalTimeline.java
Interval tree implementation in file processing/src/main/java/org/apache/druid/timeline/IntervalTree.java
Documentation comments have been included in important files and sections of code. Unit tests added.

This has been reviewed internally and is in a production Druid cluster.

…g a given interval. Using it for finding segments loaded from cache.

…node

… of matches

…ement for...

…n finding lower and higher entries

jtuglu1 · 2026-03-11T22:09:20Z

Hi – thanks. Do you have benchmarks for this?

processing/src/main/java/org/apache/druid/timeline/IntervalTree.java

+  }
+
+  @Override
+  public T remove(Object key)


pirvtech · 2026-03-11T23:03:15Z

Hi – thanks. Do you have benchmarks for this?

I will add the benchmarks.

kfaraz

Thanks for putting this together, @pirvtech!
I have often felt the need for such a datastructure too, even outside the VIT (VersionedIntervalTimeline).

Thoughts on perf

I doubt if a single query is really going to benefit from this change since doing a contains() or an overlaps() check on say 25k intervals (which is a fairly large number of intervals for a typical Druid cluster) would not be very compute intensive.
- You can think of 25k intervals as roughly 3 years worth of HOUR-granularity data.
But in high concurrency, this would still be beneficial since the VIT does all of its computations inside a giant lock. The shorter we hold the lock, the better.
Either way, we should add benchmarks as @jtuglu1 suggested for queries as well as VIT itself.

Notes on the implementation

I have left some inline comments.

Along with that, the approach would be much cleaner if you do something like this instead:

Add an IntervalNavigableMap<T> interface which extends NavigableMap<Interval, T>. This interface should have the following new methods:
- entriesContaining(Interval interval)
- entriesOverlapping(Interval interval)
- entriesMatching(Interval interval) (Is this really needed?)
- Alternatively, instead of methods that return a sub map, you could have methods that return a matching entry set.
Instead of HashMap and NavigableMap, the VIT class should use this new interface.
Add a class IntervalTreeMap<T> implements IntervalNavigableMap<T> extends TreeMap<Interval, T> and provides default implementations for the new methods.
- For example, for the entriesContaining() method, we return the whole map.
Add a new class FastSearchIntervalMap<T> which performs the optimised search.
Based on value of the fastSearch flag passed to constructor of VIT, choose the implementation of the map.
This would ensure that there are minimal changes to VIT and we can easily swap out different map implementations.

kfaraz · 2026-03-12T06:22:36Z

processing/src/main/java/org/apache/druid/java/util/common/guava/Comparators.java

    }
  };

+  private static final Comparator<Interval> INTERVAL_BY_START = new Comparator<>()


Can't we just reuse the existing comparators INTERVAL_BY_START_THEN_END and INTERVAL_BY_END_THEN_START?

kfaraz · 2026-03-12T06:36:58Z

processing/src/main/java/org/apache/druid/timeline/VersionedIntervalTimeline.java

+  private static IntervalTreeMatchMode intervalTreeMatchMode = IntervalTreeMatchMode.NONE;
+
+  static {
+    String mode = System.getProperty("experimental.timeline.intervalTreeMatchMode");


Please don't use system properties here directly. Instead pass a flag into the constructor of VersionedIntervalTimeline indicating whether the improved datastructures should be used or not.

Property name should be more like druid.segment.timeline.fastIntervalSearch with possible values true and false. (I don't think we need 3 modes).

The class creating the timeline should pass in the correct value for the flag.

kfaraz · 2026-03-12T06:37:16Z

processing/src/main/java/org/apache/druid/timeline/VersionedIntervalTimeline.java

+    }
+  }
+
+  {


Please move this field initialization inside the constructor.

kfaraz · 2026-03-12T06:38:58Z

processing/src/main/java/org/apache/druid/timeline/VersionedIntervalTimeline.java

+  private enum IntervalTreeMatchMode
+  {
+    NONE,
+    ENTRIES_ONLY(Capability.ENTRIES),


Is there a specific case when we would want to support improved search on entries only and not queries?
I think just 2 modes should suffice: ALL or NONE

pirvtech and others added 20 commits March 11, 2026 11:04

Speed up searching of partitions within the in memory data source state

9cefd2e

Narrowing search space for an interval when an exact match is not found

143f095

Implemented an optimized data structure to find intervals encompassin…

d38750e

…g a given interval. Using it for finding segments loaded from cache.

Added rebalancing

ecb9f82

Added imbalance threshold to control when to trigger rebalancing

6789646

Added rebalance and data content checks

b55c9e2

Updated doc

54b0a4e

Cleaned up some comments and names

982b238

Overwriting value if there is an exact interval match with add

99d6628

Addressing review comments

ac4b8e0

Added feature flag to control use of interval tree for matching segments

42cafff

Using a single interval field for storing the min to max range for a …

de9138b

…node

Generified the match function, so it can be used with different types…

ee4dcab

… of matches

Addressed review comments

236c0e0

Removed commented code

e63e660

Added addition documentation

0badf8c

Updated doc

e076a74

Removed commented code

af17043

Cast IntervalTree as a NavigableMap so it can become a drop in replac…

ef483c4

…ement for...

Using both start and end dates of the interval during comparision whe…

b3c6791

…n finding lower and higher entries

github-advanced-security bot found potential problems Mar 11, 2026

View reviewed changes

kfaraz reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interval tree for managing segment metadata in memory#19138

Interval tree for managing segment metadata in memory#19138
pirvtech wants to merge 20 commits intoapache:masterfrom
RivianVW-tech:segment-interval-tree

pirvtech commented Mar 11, 2026

Uh oh!

jtuglu1 commented Mar 11, 2026 •

edited

Loading

Uh oh!

Check notice

pirvtech commented Mar 11, 2026

Uh oh!

kfaraz left a comment

Uh oh!

kfaraz Mar 12, 2026

Uh oh!

kfaraz Mar 12, 2026

Uh oh!

kfaraz Mar 12, 2026

Uh oh!

kfaraz Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

+                  }
+                }
+                {

Conversation

pirvtech commented Mar 11, 2026

Uh oh!

jtuglu1 commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Check notice

Uh oh!

pirvtech commented Mar 11, 2026

Uh oh!

kfaraz left a comment

Choose a reason for hiding this comment

Uh oh!

kfaraz Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

kfaraz Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

kfaraz Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

kfaraz Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jtuglu1 commented Mar 11, 2026 •

edited

Loading