Skip to content

Conversation

@TheWisp
Copy link

@TheWisp TheWisp commented Nov 1, 2025

Summary

Removes normalize_inputs and normalize_targets calls from the predict_reward method that were accidentally left behind during the processor pipeline refactoring in PR #1452.

Details

  • The __init__ and forward methods were properly updated in commit 78b8661 to remove normalization
  • The predict_reward method was missed, leaving calls to non-existent methods
  • This completes the migration where normalization moved from internal policy methods to external processor pipelines

Fixes

AttributeError: 'Classifier' object has no attribute 'normalize_inputs'

Testing

Verified by checking git history that all other policies (ACT, Diffusion, SAC) had their normalization removed in the same way during PR #1452.

Related

…ict_reward

Remove normalize_inputs and normalize_targets calls from the predict_reward
method that were accidentally left behind during the processor pipeline
refactoring in PR huggingface#1452.

This completes the migration where normalization was moved from internal
policy methods to external processor pipelines. The __init__ and forward
methods were updated in commit 78b8661, but predict_reward was missed.

Fixes AttributeError: 'Classifier' object has no attribute 'normalize_inputs'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant