Skip to content

Comments

Compatibility with inference other than vllm < 0.10.2#700

Open
tdene wants to merge 2 commits intoNVIDIA-NeMo:mainfrom
tdene:tde/new_vllm_compat
Open

Compatibility with inference other than vllm < 0.10.2#700
tdene wants to merge 2 commits intoNVIDIA-NeMo:mainfrom
tdene:tde/new_vllm_compat

Conversation

@tdene
Copy link

@tdene tdene commented Feb 15, 2026

Inside vllm_model/app.py, there is a comment bloc that says:

       """
       START TODO remove this when NeMo RL upgrades to vLLM 0.10.2 support for prompt token ids
       """

The associated code always assumes that the user is using vLLM < 0.10.2. This removes that assumption, allowing NeMo Gym to work easily with vLLM >= 0.10.2 or any other inference framework.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Feb 15, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@tdene tdene marked this pull request as draft February 15, 2026 21:57
@tdene tdene force-pushed the tde/new_vllm_compat branch from 9d30e72 to 94afba2 Compare February 15, 2026 23:01
@tdene tdene marked this pull request as ready for review February 15, 2026 23:01
Signed-off-by: Teodor-Dumitru Ene <tene@nvidia.com>
@tdene tdene force-pushed the tde/new_vllm_compat branch from 94afba2 to 111ef6b Compare February 15, 2026 23:08
Signed-off-by: Teodor-Dumitru Ene <tene@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant