[RLlib] APPO accelerate vol 02: Various enhancements. #50162

sven1977 · 2025-01-31T14:40:56Z

APPO accelerate vol 02: Various enhancements.

Remove complex LearnerGroup.update... logic with async-tags and overload of ray.wait requests.
Allow Learner actors to have GPUs AND CPUs.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

simonsays1980

LGTM. Some minor nits.

simonsays1980 · 2025-01-31T15:38:04Z

rllib/algorithms/impala/impala_learner.py

        # Default is to have a learner thread.
        if not hasattr(self, "_learner_thread_in_queue"):
            self._learner_thread_in_queue = deque(maxlen=self.config.learner_queue_size)

+        # Create and start the GPU loader thread(s).
+        if self.config.num_gpus_per_learner > 0:
+            self._gpu_loader_threads = [


Let's name the threads for better debugging.

simonsays1980 · 2025-01-31T15:39:43Z

rllib/algorithms/impala/impala_learner.py

+class _GPULoaderThread(threading.Thread):
+    def __init__(
+        self,
+        *,


Use the name argument here and pass it to super

simonsays1980 · 2025-01-31T15:42:35Z

rllib/algorithms/impala/impala_learner.py

@@ -166,9 +225,8 @@ def step(self):
                ma_batch_on_gpu = self._in_queue.sample()
            else:
                # Queue is empty: Sleep a tiny bit to avoid CPU-thrashing.


Usually the scheduler makes a good job in providing runtime to each process. This while loop could slow things down and in worst case block.

I do think we have to sleep here (or at least be careful with removing this logic).

The "queue" here is a deque, which doesn't have the GIL-release logic of a Queue.get(). Looping here with while True: without sleeping would certainly harm performance.

…_accelerate_02_various_enhancements

Signed-off-by: sven1977 <[email protected]>

…_accelerate_02_various_enhancements

Signed-off-by: sven1977 <[email protected]>

)

wip

3002dd4

Signed-off-by: sven1977 <[email protected]>

sven1977 requested a review from simonsays1980 as a code owner January 31, 2025 14:40

sven1977 enabled auto-merge (squash) January 31, 2025 14:42

github-actions bot added the go add ONLY when ready to merge, run all tests label Jan 31, 2025

sven1977 disabled auto-merge January 31, 2025 14:42

sven1977 assigned simonsays1980 Jan 31, 2025

sven1977 added rllib RLlib related issues rllib-system system issues, runtime env, oom, etc rllib-newstack labels Jan 31, 2025

LINT

5d23077

Signed-off-by: sven1977 <[email protected]>

simonsays1980 approved these changes Jan 31, 2025

View reviewed changes

sven1977 added 7 commits February 1, 2025 05:21

Merge branch 'master' of https://github.com/ray-project/ray into appo…

02602ec

…_accelerate_02_various_enhancements

wip

4c04ffa

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into appo…

05e5adf

…_accelerate_02_various_enhancements

wip

3245569

Signed-off-by: sven1977 <[email protected]>

wip

2398215

Signed-off-by: sven1977 <[email protected]>

wip

91e1d0d

Signed-off-by: sven1977 <[email protected]>

wip

49d3213

Signed-off-by: sven1977 <[email protected]>

sven1977 requested review from maxpumperla and a team as code owners February 2, 2025 17:29

sven1977 enabled auto-merge (squash) February 2, 2025 18:59

sven1977 merged commit 8c792db into ray-project:master Feb 2, 2025
6 checks passed

sven1977 deleted the appo_accelerate_02_various_enhancements branch February 2, 2025 19:21

eddyxu pushed a commit to lancedb/ray that referenced this pull request Feb 3, 2025

[RLlib] APPO accelerate vol 02: Various enhancements. (ray-project#50162

86c9967

)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] APPO accelerate vol 02: Various enhancements. #50162

[RLlib] APPO accelerate vol 02: Various enhancements. #50162

sven1977 commented Jan 31, 2025 •

edited

Loading

simonsays1980 left a comment

simonsays1980 Jan 31, 2025

sven1977 Feb 1, 2025

simonsays1980 Jan 31, 2025

sven1977 Feb 1, 2025

simonsays1980 Jan 31, 2025

sven1977 Feb 1, 2025

[RLlib] APPO accelerate vol 02: Various enhancements. #50162

[RLlib] APPO accelerate vol 02: Various enhancements. #50162

Conversation

sven1977 commented Jan 31, 2025 • edited Loading

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 Jan 31, 2025

Choose a reason for hiding this comment

sven1977 Feb 1, 2025

Choose a reason for hiding this comment

simonsays1980 Jan 31, 2025

Choose a reason for hiding this comment

sven1977 Feb 1, 2025

Choose a reason for hiding this comment

simonsays1980 Jan 31, 2025

Choose a reason for hiding this comment

sven1977 Feb 1, 2025

Choose a reason for hiding this comment

sven1977 commented Jan 31, 2025 •

edited

Loading