Move RLOOTrainer to trl.experimental.rloo #4484

behroozazarkhalili · 2025-11-06T02:16:35Z

Summary

This PR migrates RLOOTrainer to the experimental module as part of the TRL V1 refactoring effort.

Changes

Create trl.experimental.rloo module with RLOOTrainer and RLOOConfig
Add deprecation stubs in trl.trainer with FutureWarning (removal in TRL 0.29.0)
Update imports in tests, examples (3 files), and scripts
Update documentation:
- Move RLOO from Trainers to Experimental section in _toctree.yml
- Add deprecation notice to rloo_trainer.md
- Update index.md to show experimental.rloo.RLOOTrainer with 🧪 emoji

Testing

All existing tests continue to work with deprecation warnings
Backward compatibility maintained through deprecation stubs
Import paths verified in tests and examples

Contributes to #4374
Fixes #4468

- Create trl.experimental.rloo module with RLOOTrainer and RLOOConfig - Add deprecation stubs in trl.trainer with FutureWarning (removal in TRL 0.29.0) - Update imports in tests, examples, and documentation - Move RLOO to Experimental section in docs/_toctree.yml Contributes to #4374 Fixes #4468

HuggingFaceDocBuilderDev · 2025-11-06T02:19:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

- Update dataset_formats.md: RLOOTrainer → experimental.rloo.RLOOTrainer - Update example_overview.md: RLOOTrainer → experimental.rloo.RLOOTrainer - Update rloo_trainer.md: all trainer references to experimental path - Move test file to tests/experimental/test_rloo_trainer.py - Update test imports to use parent directory reference Follows pattern from XPO PR #4485

Remove implementation code that was incorrectly merged into the deprecation wrapper. The wrapper should only contain the deprecation warning and delegate to the experimental module.

behroozazarkhalili added 4 commits November 5, 2025 19:09

Fix ruff linting errors for RLOOTrainer migration

bf2adc6

Merge main into refactor/move-rloo-to-experimental

a9f2151

Fix RLOO trainer wrapper file corruption

63a99cb

Remove implementation code that was incorrectly merged into the deprecation wrapper. The wrapper should only contain the deprecation warning and delegate to the experimental module.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move RLOOTrainer to trl.experimental.rloo #4484

Move RLOOTrainer to trl.experimental.rloo #4484

behroozazarkhalili commented Nov 6, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move RLOOTrainer to trl.experimental.rloo #4484

Are you sure you want to change the base?

Move RLOOTrainer to trl.experimental.rloo #4484

Conversation

behroozazarkhalili commented Nov 6, 2025

Summary

Changes

Testing

Uh oh!

HuggingFaceDocBuilderDev commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants