perf(fts): use block max for or tail bounds by BubbleCal · Pull Request #7435 · lance-format/lance

BubbleCal · 2026-06-24T03:39:13Z

Performance Improvement

What is the performance issue or bottleneck?

FTS OR WAND kept lead postings that were moved back into tail using the posting list's full approximate upper bound. That made the tail bound too loose inside the active block-max window and reduced the pruning benefit available from existing per-block max-score metadata.

How does this PR improve performance?

When OR WAND moves a lead posting back into tail, it now uses the current block max as the tail upper bound if the active block window still covers the next target. If the window is expired or not applicable, it falls back to the posting list's approximate upper bound so later high-score blocks remain reachable.

The PR also advances a headless OR tail to the next compressed block window when needed, while avoiding no-progress advancement for plain postings.

This intentionally does not implement impacts skip / ImpactsDISI, shared-floor import, MaxScoreCache, or score-only WAND fast paths.

Benchmark context

Ablation comparison between main-style tail upper bounds and this OR tail block-max optimization:

Case	Main-style tail bound QPS	This optimization QPS	Lift from this optimization
match_len3_k10	324.7	475.6	+46.5%
match_len3_k100	218.0	324.4	+48.8%
phrase_len3_stop1_k10	629.6	631.5	+0.3%
phrase_len3_stop1_k100	425.6	415.8	-2.3%

Validation

All Cargo commands were run with an isolated target directory.

cargo fmt --all
git diff --check
cargo test -p lance-index scalar::inverted::wand::tests::test_or_ -- --nocapture — 4 passed
cargo test -p lance-index scalar::inverted::wand::tests -- --nocapture — 34 passed
cargo check -p lance-index --tests
cargo check --workspace --tests --benches
cargo clippy --all --tests --benches -- -D warnings

codecov · 2026-06-24T04:20:52Z

Codecov Report

❌ Patch coverage is 99.32432% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-index/src/scalar/inverted/wand.rs	99.32%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

Xuanwo · 2026-06-24T04:31:05Z

+        // A low-scoring tail can be the only iterator left in the current
+        // window. Move to the next window so a later high-scoring block is still
+        // reachable instead of ending the disjunction early.
+        self.update_max_scores(up_to + 1);


When this path advances a headless tail window, update_max_scores() derives the next up_to from tail.peek() in the no-head/no-lead case. The tail heap is ordered by upper bound, so if the top tail is already on its final block while a lower-bound tail still has later compressed blocks, up_to becomes TERMINATED_DOC_ID and the OR path stops before visiting those later blocks, dropping valid matches.

perf(fts): use block max for or tail bounds

7d01935

github-actions Bot added A-index Vector index, linalg, tokenizer performance labels Jun 24, 2026

BubbleCal marked this pull request as ready for review June 24, 2026 04:10

Xuanwo reviewed Jun 24, 2026

View reviewed changes

fix(fts): preserve tail range max across wand windows

0a3d349

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(fts): use block max for or tail bounds#7435

perf(fts): use block max for or tail bounds#7435
BubbleCal wants to merge 2 commits into
mainfrom
yang/fts-or-tail-blockmax

BubbleCal commented Jun 24, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 24, 2026 •

edited

Loading

Uh oh!

Xuanwo Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

BubbleCal commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance Improvement

What is the performance issue or bottleneck?

How does this PR improve performance?

Benchmark context

Validation

Uh oh!

codecov Bot commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Xuanwo Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BubbleCal commented Jun 24, 2026 •

edited

Loading

codecov Bot commented Jun 24, 2026 •

edited

Loading