[Bugfix] V1 Fix the cursor leakage issue during request scheduling. #21173

CLFutureX · 2025-07-18T09:32:21Z

Background: When iterating through running requests, if preemption occurs and the preempted request is before the current cursor req_index, subsequent requests will be missed during scheduling.

Solution: When the preempted request is determined to be before the current cursor req_index, adjust the cursor to move forward to avoid missing requests.

Signed-off-by: CLFutureX <[email protected]>

gemini-code-assist

Code Review

The pull request aims to fix a bug in the scheduler where requests could be missed during preemption. The proposed logic correctly identifies the condition for adjusting the loop cursor. However, the implementation contains a critical bug that will cause a TypeError at runtime when trying to find the index of the preempted request. I've provided a suggestion to fix this issue.

vllm/v1/core/sched/scheduler.py

github-actions · 2025-07-18T10:06:10Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

njhill

Thanks @CLFutureX, good catch!

vllm/v1/core/sched/scheduler.py

njhill

cc @WoosukKwon to confirm

njhill · 2025-07-21T17:21:33Z

@CLFutureX would you mind rebasing on latest main? I think the test failures are transient things that should now be fixed.

Signed-off-by: CLFutureX <[email protected]>

WoosukKwon

Thanks for the PR! Can we add a test about this?

CLFutureX · 2025-07-23T03:26:38Z

@CLFutureX I think the rebase wasn't done quite right - the PR shouldn't have all of these commits

yes, I'm sorry. I accidentally rebased the code onto the wrong branch yesterday, but I've since reverted it and adjusted the branch properly.

[Bugfix] fix the cursor leakage issue.

b60dc81

Signed-off-by: CLFutureX <[email protected]>

CLFutureX requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners July 18, 2025 09:32

mergify bot added the v1 label Jul 18, 2025

gemini-code-assist bot reviewed Jul 18, 2025

View reviewed changes

vllm/v1/core/sched/scheduler.py Outdated Show resolved Hide resolved

njhill reviewed Jul 18, 2025

View reviewed changes

vllm/v1/core/sched/scheduler.py Outdated Show resolved Hide resolved

njhill approved these changes Jul 18, 2025

View reviewed changes

CLFutureX force-pushed the fix_running_index branch from 2100ff3 to a352810 Compare July 20, 2025 09:01

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 21, 2025

CLFutureX requested review from hmellor, jeejeelee, mgoin, KuntaiDu, DarkLight1337, tlrmchlsmth, simon-mo, youkaichao, houseroad and aarnphm as code owners July 22, 2025 02:07

mergify bot added documentation Improvements or additions to documentation ci/build frontend llama Related to Llama models multi-modality Related to multi-modality (#4194) labels Jul 22, 2025

mergify bot removed the tpu Related to Google TPUs label Jul 23, 2025

CLFutureX force-pushed the fix_running_index branch from 1fc2ec2 to b60dc81 Compare July 23, 2025 02:59

CLFutureX added 3 commits July 23, 2025 11:03

[Bugfix] fix the cursor leakage issue.

0d20531

Signed-off-by: CLFutureX <[email protected]>

code optimizer

c22dca4

Signed-off-by: CLFutureX <[email protected]>

Signed-off-by: CLFutureX [email protected]

ecf142b

WoosukKwon approved these changes Jul 23, 2025

View reviewed changes

DarkLight1337 requested review from alexm-redhat and removed request for tlrmchlsmth, mgoin, KuntaiDu, jeejeelee, hmellor, simon-mo, youkaichao, aarnphm, houseroad, alexm-redhat and DarkLight1337 July 23, 2025 08:25

njhill added the bug Something isn't working label Jul 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] V1 Fix the cursor leakage issue during request scheduling. #21173

[Bugfix] V1 Fix the cursor leakage issue during request scheduling. #21173

CLFutureX commented Jul 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

njhill left a comment

Uh oh!

njhill commented Jul 21, 2025

Uh oh!

WoosukKwon left a comment

Uh oh!

CLFutureX commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

[Bugfix] V1 Fix the cursor leakage issue during request scheduling. #21173

Are you sure you want to change the base?

[Bugfix] V1 Fix the cursor leakage issue during request scheduling. #21173

Conversation

CLFutureX commented Jul 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

njhill commented Jul 21, 2025

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

CLFutureX commented Jul 23, 2025

Uh oh!

Uh oh!

CLFutureX commented Jul 18, 2025 •

edited by github-actions bot

Loading