feat: uv lock rule instead of genrule #2657

aignas · 2025-03-11T00:15:49Z

This change re-implements the uv pip compile as a set of rules instead of
using a genrule. This makes the setup more RBE friendly and it also fixes
some of existing issues in the exec tools toolchain.

The lock macro in the //python/uv:lock.bzl now creates three public
targets: <name>, <name>.update and <name>.run. The first will provide you
with the locked requirements.txt file that is used in the <name>.update
executable target when updating the in-source copy of the file. The
<name>.run provides an executable target that hardcodes all of the uv args
from the <name> rule in a shell script and allows user to debug the execution
and add extra arguments at the command line.

The test target is no longer included, but users can define it themselves
with the help of native_test.

Things that I could not test and would benefit from the community help:

Windows support - the repository has a rudimentary script, but I am almost
sure that it is likely not working, so PRs there are welcome.
The integration tests are not running on RBE because of the current RBE
cluster setup. If you see issues in your RBE setup, PRs are welcome.
keyring integration to pull packages from private index servers is untested
as of now, but I see no reason why it should not work.

Work towards #1325
Work towards #1975
Related #2663

aignas

OK, after doing self review I think I want to have:

A script for launching uv from bash that would be compatible with UNIX.
A script for launching uv from powershell that would be compatible with Windows (generative AI may help with initial translation here).
Rewrite the internals a little to drop the Python usage (or at least most of it).

python/uv/private/BUILD.bazel

python/uv/private/lock.bzl

rickeylev · 2025-03-13T23:33:43Z

The core of the implementation looks good to me (rule that looks up toolchains to run an build action; lock_run uses info lock provides via a provider). I have a variety of smaller comments about some particulars, but gtg now, so I'll have to wait until, ah, probably the weekend sometime.

The maybe_out behavior is interesting/clever, I just noticed that and will take a closer look.

rickeylev

got about half way through

python/uv/private/lock.bzl

rickeylev

Also, to clarify, this is the sort of interface I imagined:

bazel build //:requirements generates a requirements file using a build action
bazel run //:requirements.update generates a requirements file using a build action and copies it into the local client to update the requirements file
bazel run //:requirements.run -- <args> more of a direct invocation; runs uv directly; this is for debugging or experimenting with settings without having to modify the BUILD file.

The reason for (2) to use the output of (1) is because that's where the magic of bazel happens. By this i mean: build actions have isolated/deterministic/hermetic capabilities, can run remotely, are more amenable to having transitions applied, and are better about not having different output per user machine.

python/uv/private/lock.bzl

aignas · 2025-03-16T16:13:59Z

OK, addressed the comments, PTAL.

I see that Windows is failing, because something is not compatible with the CI Windows version. I wonder if this means we should just tell users that Windows may be unsupported?

aignas · 2025-03-19T00:58:30Z

Open questions:

Currently providing keyring and other authentication things need to happen via system and this has not been fully tested. This might be relevant for compile_pip_requirements does not use credential helper #2663 were users need to be able to configure credential helpers for pulling packages from internal mirrors.
Should we use python_toolchain or a target to the current python interpreter? Passing //python:none could act as a way to indicate that I need python from the system, i.e. python. This would mean that we could use the //python/bin:interpreter which could include dependencies used during locking (e.g. keyring and similar). Right now I am not sure how to get that wired through. Maybe we should have a few more tests that setup a mirror that needs the keyring dep to connect, otherwise it is hard to not break the behaviour, but I would like to merge this as is for now because the PR is getting big and it is hard to keep track of all of the things.

rickeylev · 2025-03-19T01:14:02Z

Sorry, some short notice deadlines came upon me. I'll be able to have another look Weds evening or after

rickeylev · 2025-03-19T04:40:03Z

Should we use python_toolchain or a target to the current python interpreter?
Passing an interpreter means additional deps (keyring) can be captured

Since uv is coming from a toolchain, and we need python to match uv, I think both should come from a toolchain. A behavior unique to toolchains is that a group of toolchains can get resolved with the same config state. Getting the same behavior using labels, or a mix of a label and toolchain, might be tricky (it might be possible using exec groups?).

python/uv/private/lock.bzl

rickeylev · 2025-03-19T04:59:45Z

python/uv/private/lock.bzl

+    return [
+        DefaultInfo(
+            executable = executable,
+            runfiles = ctx.runfiles(transitive_files = info.srcs),


I think there's a subtle config mismatch here: info.srcs contains uv in exec config, but here it's going to run in target config.

I'm OK with ignoring this for now, though, to keep progress moving along.

I have added a transition, but I am still new as to how to ensure that this will be in exec configuration.

I can follow this up with a separate PR.

The transition LGTM.

When a transition is applied to the rule itself, it decides what the "target" configuration is for the current target. It doesn't affect the exec config directly.

When toolchain resolution occurs, Bazel finds a toolchain that is compatible with the current target configuration. e.g. if python_version=3.12.1 is in the target configuration, then Bazel looks for a matching exec_tools toolchain with target_compatible_with=3.12. The e.g. exec_interpreter attribute will be in the exec config, but that's fine; the toolchain is claiming all its pieces are intended to produce output valid for 3.12.

HTH

rickeylev · 2025-03-19T05:11:09Z

python/uv/private/lock.bzl

+    args.run_shell.add("--no-progress")
+    args.run_shell.add("--quiet")
+
+    ctx.actions.run_shell(


I'm having trouble understanding why this step needs the copy step as part of its execution.

Isn't this the same behavior?

srcs = list(ctx.attr.srcs) if existing_file: srcs.append(existing_file) output = declare_file(name + ".out") ctx.actions.run([uv, "--output={output}"] + srcs, inputs=srcs, output=output)

uv is going to overwrite whatever --output specifies, right?

uv expects the previous output to be in the location that is defined by --output={output}. If you pass it as a source then you will have extra log lines via requirements-existing.txt, which is not what you want here.

rickeylev · 2025-03-19T05:15:41Z

python/uv/private/lock.bzl

+    # FIXME @aignas 2025-03-17: should we have one more target that transitions
+    # the python_version to ensure that if somebody calls `bazel build
+    # :requirements` that it is locked with the right `python_version`?


A separate target, no. An attribute with rule-level cfg transition, yes.

This also enables tricks like this:

lock(python_version="3.10", srcs=select(":py310": "requirements_310.txt", ...))

Similarly, because an attr.output is not used, the outputs can be varied to e.g. include the python version (or platform, etc) into the output file name.

Thanks for the nudge. I have added the transition to the lock rule and I think the overall design is now simpler.

I have also created an internal expand_template rule that makes the wiring of files easier. I have added the python version to the file name, so it might be both, a good and a bad decision and we will see. :)

python/uv/private/lock.bzl

aignas · 2025-03-20T14:45:12Z

RBE is failing with an error:

bazel-out/k8-opt-exec-ST-150d2d5f4ddd/bin/python/private/python3: error while loading shared libraries: /b/f/w/bazel-out/k8-opt-exec-ST-150d2d5f4ddd/bin/python/private/../lib/libpython3.11.so.1.0: cannot open shared object file: No such file or directory

Not sure exactly why this is happening because I am passing interpreter.files_to_run to the action.

I think the documentation mentioning to use .files_to_run may be wrong.

EDIT: this guess was wrong. I am fiddling with the cfg for the uv_toolchain now. I think it should be exec instead of target.

EDIT2: the uv_toolchain has nothing to do with that, because the error is coming from python not finding the .so file, which to me suggests that the files are missing, but that should not be the case.

This change implements the uv pip compile as a rule. In order to also make things easier to debug we provide a runnable rule that has the same arguments and updates the source tree output file automatically. The main design is to have a regular lock rule and then it returns a custom provider that has all of the recipe ingredients to construct an executable rule. The execution depends on having bash or powershell, however the powershell script is not yet complete and requires some help from the community. Work towards bazel-contrib#1975. Address all of the comments

aignas · 2025-03-22T12:40:43Z

It seems that the current_py_exec_toolchain had a bug that was not letting me use it in RBE. My analysis went as follows:

Setup a minimal RBE on my machine like mentioned in Third party dependencies are incorrect when using RBE because host != exec #2241
Add print statements to inspect what is in the sandbox:
i. The python toolchain with all of the files was there
ii. The current_py_exec_toolchain symlink was not a symlink and instead it was a copy.
Given that the symlink was dereferenced, there are multiple ways to solve this:
i. Use a dangling symlink, like the one where we are pointing outside the sandbox
when non-hermetic toolchain is used.
ii. Somehow symlink all of the directories so that the lib and other
folders can be found.

I chose the 3.i. method because I only need one extra symlink and it does not
require exposing extra data through the py_runtime. E.g. I would need to
expose the contents of bin folder most likely and the lib, etc folders for
everything to work properly.

Maybe the 3.ii solution could be also beneficial for creating #2156 and setting up
the venv inside the py_executable rules, but we can look at that later.

EDIT: it seems that there has to be an extra option that I can chose as the 3.i
breaks the integration tests.

EDIT2: I wonder if I am hitting bazelbuild/bazel#23620

EDIT3: OK, so in the end the solution was to forward the runtime field from
the TARGET_TOOLCHAIN_TYPE to the EXEC_TOOLS_TOOLCHAIN_TYPE which means that
we stop relying on symlinks created by the current_interpreter_executable. To
be honest, I am not sure if it makes sense to keep it for anything else but
just returning the toolchain - the interpreter symlink will not work properly
in RBE and it is quite difficult to debug when that is the case.

python/uv/lock.bzl

python/private/py_exec_tools_info.bzl

amaranthjinn · 2025-03-24T19:01:04Z

Would this feature fix the issue #2640?

aignas · 2025-03-25T00:03:23Z

Would this feature fix the issue #2640?

It would not, because the linked issue is about building wheels rather than locking them.

rickeylev · 2025-03-25T02:17:40Z

python can't find its .so files when a "regular" symlink action is used ... bazel #23620

Yeah, I'm pretty sure that's what you're seeing. I haven't looked at the PR code yet, but what I recall was you can't simply ctx.actions.symlink(<output>, <underlying interpreter>) because Bazel RBE is prone to creating a copy instead of a symlink. It works locally because Python has a behavior where it will check if argv[0] is a symlink, and if so, realpath() to find the actual location of the python interpreter (and thus all the runtime's files). I think this is to support stuff like venvs, or creating a convenience symlink to the interpreter in one place while it's actually installed in another (e.g. /usr/lib/python3 is a symlink to /usr/lib/python3.10, or whatever).

Using declare_symlink, or a wrapper script, should work, though. I'll have a look at the PR now.

rickeylev · 2025-03-25T03:00:53Z

Beh. Looking at current_interpreter_executable.bzl, I think changing L93 to use declare_symlink() should work?

I tried setting up a local RBE to test it, but couldn't get bazel and the RBE talking. I'll have to try again when I have more time

aignas · 2025-03-25T04:42:07Z

Beh. Looking at current_interpreter_executable.bzl, I think changing L93 to use declare_symlink() should work?

I tried setting up a local RBE to test it, but couldn't get bazel and the RBE talking. I'll have to try again when I have more time

If you use declare_symlink then you cannot use a target_file and you need to use target_path. So that is only possible to work with target_path in my tests. This is documented in https://bazel.build/rules/lib/builtins/actions#symlink.

aignas · 2025-03-25T04:48:40Z

tests/uv/lock/lock_tests.bzl

+        ],
+        # It seems that the CI remote executors for the RBE do not have network
+        # connectivity. Is it only our setup or is it a property of RBE?
+        tags = ["no-remote-exec"],


FYI this is where the RBE is disabled for our tests

rickeylev · 2025-03-25T07:33:02Z

I got an RBE setup locally. Yeah, current_interpreter_executable (the thing that backs ExecTools.exec_interpreter) does indeed look entirely broken with RBE. Argh >.<

So...

An executable rule has to define its own output file. It can't forward on another file. Hence e.g. current_interpreter_executable has to call e.g. declare_file/symlink
If it uses declare_file(), it can't use symlink(), because RBE will create a copy, and Python can no longer traverse back to its actual install location.
If it uses declare_symlink, then symlink() has to write either a bin-relative path (the file that is a sibling of the runfiles directory), or a runfiles-relative path (the copy of the executable within the runfiles tree). The former allows e.g. ctx.actions.run(executable=...) to work; the latter allows e.g. ctx.actions.run_shell("<runfiles path to to executable>") to work.

The only way I can think of to make both work is, essentially, to make the output executable a wrapper. e.g. shell code that figures out how to locate the file it wants to run and execs it.

Or, maybe if py_runtime() is directly executable? i.e. sets executable=True, and returns DefaultInfo(executable=...), and the whole target gets forwarded on.

The reason I'm so keen on having an exectuable=True attribute (i.e. a thing with FilesToRun provider) that is fed to ctx.actions.run() is that is supposed to be the proper abstraction -- "run the interpreter, let its executable rule figure out the details". This is in comparison to e.g. having to directly use PyRuntime, then manually pass PyRuntime.files etc to ctx.actions.run.

Well, its late now, so gotta log off.

aignas · 2025-03-25T08:57:32Z

Hmmm... since I have a question on RB slack about the requires-network, I can wait on this.

I kind of am getting where you are coming from - just running python should not be that hard and should not require py_runtime. I'll think about it as well, gotta go.

rickeylev · 2025-03-26T03:35:22Z

I poked this some more and have some ideas, but want to explore them a bit more.

I don't want to block this PR on them, though. How about for now, revert the changes to the exec toolchain stuff. The uv rule can still get at the PyRuntime object via exec_interpreter: exec_interpreter[ToolchainInfo].py3_runtime.

rickeylev

Just remove the exec tools toolchain changes as I mentioned in the other comment, otherwise LGTM

rickeylev · 2025-03-26T05:00:21Z

python/private/sentinel.bzl

-    return [SentinelInfo()]
+    return [
+        SentinelInfo(),
+        # Also output ToolchainInfo


Suggested change

# Also output ToolchainInfo

# Also output ToolchainInfo to allow it to be used for no-op toolchains

rickeylev · 2025-03-26T05:07:42Z

python/uv/private/lock.bzl

+        progress_message = "Creating a requirements.txt with uv: //{}:{}".format(
+            ctx.label.package,
+            ctx.label.name,
+        ),


nit: Use %{label} instead

Suggested change

progress_message = "Creating a requirements.txt with uv: //{}:{}".format(

ctx.label.package,

ctx.label.name,

),

progress_message = "Creating a requirements.txt with uv: %{label}",

rickeylev · 2025-03-26T05:22:37Z

python/uv/private/lock.bzl

+    doc = """\
+""",


nit: add doc or just omit the doc attribute

rickeylev · 2025-03-26T05:36:20Z

python/uv/private/lock.bzl

+    return [
+        DefaultInfo(
+            executable = executable,
+            runfiles = ctx.runfiles(transitive_files = info.srcs),


The transition LGTM.

When a transition is applied to the rule itself, it decides what the "target" configuration is for the current target. It doesn't affect the exec config directly.

When toolchain resolution occurs, Bazel finds a toolchain that is compatible with the current target configuration. e.g. if python_version=3.12.1 is in the target configuration, then Bazel looks for a matching exec_tools toolchain with target_compatible_with=3.12. The e.g. exec_interpreter attribute will be in the exec config, but that's fine; the toolchain is claiming all its pieces are intended to produce output valid for 3.12.

HTH

rickeylev · 2025-03-26T05:42:24Z

tests/uv/lock/lock_tests.bzl

+        # It seems that the CI remote executors for the RBE do not have network
+        # connectivity. Is it only our setup or is it a property of RBE?


It's not intrinsic to RBE, so must be something wit our RBE.

aignas commented Mar 11, 2025

View reviewed changes

aignas force-pushed the uv-lock-rule-instead-of-genrule branch from fcc17c3 to 7ddfd28 Compare March 13, 2025 13:52

aignas changed the title ~~uv lock rule instead of genrule~~ feat: uv lock rule instead of genrule Mar 13, 2025

aignas force-pushed the uv-lock-rule-instead-of-genrule branch from 928f1e2 to 2c29ae2 Compare March 13, 2025 15:23

aignas marked this pull request as ready for review March 13, 2025 15:23

aignas requested a review from rickeylev as a code owner March 13, 2025 15:23

bazel-contrib deleted a comment from aignas Mar 13, 2025

rickeylev reviewed Mar 14, 2025

View reviewed changes

python/uv/private/lock.bzl Outdated Show resolved Hide resolved

rickeylev reviewed Mar 14, 2025

View reviewed changes

aignas mentioned this pull request Mar 18, 2025

compile_pip_requirements does not use credential helper #2663

Closed

aignas force-pushed the uv-lock-rule-instead-of-genrule branch 2 times, most recently from daab523 to c67cdcd Compare March 18, 2025 09:23

rickeylev approved these changes Mar 19, 2025

View reviewed changes

aignas mentioned this pull request Mar 21, 2025

toolchain resolution may be wrong when python_version does not include a patch version #2685

Closed

aignas force-pushed the uv-lock-rule-instead-of-genrule branch from f6a052f to 7a37d5d Compare March 22, 2025 12:34

fixup

363780a

aignas added 5 commits March 22, 2025 21:42

decrease the diff for the py_exec_tools_toolchain

31370ea

revert the py_exec_tools_toolchain file

69cb0b4

fix the RBE config by forwarding the runtime to the exec_tools_info

a613d27

add a note about RBE

dd8d449

fix the sentinel

d8bdd3c

aignas added 3 commits March 23, 2025 00:33

add docs

2e7211e

add a note about the exec toolchain changes.

b3bd1c9

Update py_exec_tools_toolchain.bzl docs

afe308c

aignas commented Mar 24, 2025

View reviewed changes

python/uv/lock.bzl Outdated Show resolved Hide resolved

Update python/uv/lock.bzl

0573d6b

aignas commented Mar 24, 2025

View reviewed changes

python/private/py_exec_tools_info.bzl Outdated Show resolved Hide resolved

aignas added 4 commits March 24, 2025 11:23

Update python/private/py_exec_tools_info.bzl

64c6c8c

Delete tests/support/remote-toolchains/BUILD.bazel

318a462

Merge branch 'main' into uv-lock-rule-instead-of-genrule

0ce1ea5

retain the universal requirements for the bzlmod example

db046aa

aignas commented Mar 25, 2025

View reviewed changes

rickeylev mentioned this pull request Mar 26, 2025

exec tools exec_interpreter broken on RBE #2703

Open

rickeylev approved these changes Mar 26, 2025

View reviewed changes

aignas added 3 commits March 27, 2025 23:04

Merge branch 'main' into uv-lock-rule-instead-of-genrule

75c9250

comment: address most of the comments

5220b28

revert the exec_tools changes

f63d582

aignas enabled auto-merge March 27, 2025 14:24

aignas added this pull request to the merge queue Mar 27, 2025

Merged via the queue into bazel-contrib:main with commit 09145b9 Mar 27, 2025
3 checks passed

aignas deleted the uv-lock-rule-instead-of-genrule branch March 27, 2025 22:34

aignas mentioned this pull request Apr 5, 2025

compile_pip_requirements .update does not work with local sub modules #2733

Open

	# Also output ToolchainInfo
	# Also output ToolchainInfo to allow it to be used for no-op toolchains

		# It seems that the CI remote executors for the RBE do not have network
		# connectivity. Is it only our setup or is it a property of RBE?

Uh oh!

feat: uv lock rule instead of genrule #2657

feat: uv lock rule instead of genrule #2657

Uh oh!

Conversation

aignas commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aignas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rickeylev commented Mar 13, 2025

Uh oh!

rickeylev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rickeylev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aignas commented Mar 16, 2025

Uh oh!

aignas commented Mar 19, 2025

Uh oh!

rickeylev commented Mar 19, 2025

Uh oh!

rickeylev commented Mar 19, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aignas commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aignas commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amaranthjinn commented Mar 24, 2025

Uh oh!

aignas commented Mar 25, 2025

Uh oh!

rickeylev commented Mar 25, 2025

Uh oh!

rickeylev commented Mar 25, 2025

Uh oh!

aignas commented Mar 25, 2025

aignas commented Mar 11, 2025 •

edited

Loading

aignas commented Mar 20, 2025 •

edited

Loading

aignas commented Mar 22, 2025 •

edited

Loading