Skip to content

Conversation

@yaochengji
Copy link
Collaborator

@yaochengji yaochengji commented Oct 15, 2025

Purpose

Move the legacy torch_xla requirement out of tpu.txt to avoid conflicts with the new tpu-inference backend.

@yaochengji yaochengji requested a review from simon-mo October 15, 2025 19:30
@mergify
Copy link

mergify bot commented Oct 15, 2025

Documentation preview: https://vllm--26938.org.readthedocs.build/en/26938/

@mergify mergify bot added documentation Improvements or additions to documentation ci/build tpu Related to Google TPUs labels Oct 15, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request moves the torch_xla dependency from requirements/tpu.txt to the installation documentation to avoid conflicts with the new tpu-inference backend. While this change is correct for manual source builds, it introduces a critical issue in the Docker build process for TPUs. The docker/Dockerfile.tpu relies on requirements/tpu.txt and will no longer install torch_xla, breaking the legacy TPU backend fallback. I've left a critical comment with a suggested fix for the Dockerfile.

Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Comment on lines 12 to 14
setuptools==78.1.0
nixl==0.3.0
tpu_info==0.4.0

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Keep torch_xla installation in TPU image build

Removing torch_xla[tpu, pallas]==2.8.0 from the TPU requirements means the Docker build that uses this file (docker/Dockerfile.tpu) will now uninstall torch_xla and never reinstall it. The Dockerfile still runs pip uninstall -y torch torch_xla torchvision before pip install -r requirements/tpu.txt, so the resulting image no longer contains torch_xla and any TPU runtime code that imports it will fail at startup. Consider either leaving torch_xla in this requirements file or explicitly installing it again inside the Dockerfile.

Useful? React with 👍 / 👎.

@mgoin
Copy link
Member

mgoin commented Oct 15, 2025

@yaochengji as the bot says, will this affect the dockerfile and should we update that too?

Signed-off-by: Chengji Yao <[email protected]>
Signed-off-by: Chengji Yao <[email protected]>
@yaochengji
Copy link
Collaborator Author

@yaochengji as the bot says, will this affect the dockerfile and should we update that too?

@mgoin thanks for reminding me! Yes, we need to make the dockerfile for the torch_xla backend correct.

@yaochengji yaochengji marked this pull request as draft October 22, 2025 03:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build documentation Improvements or additions to documentation tpu Related to Google TPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants