Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
StoneT2000 committed Aug 9, 2024
1 parent 43fd378 commit b15eff6
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -164,6 +164,12 @@ XLA_PYTHON_CLIENT_PREALLOCATE=false python scripts/collect_demos.py exps/path/to
num_envs=8 num_episodes=1000
```

### Learning without Action Labels, just Environment States

There are situations where a demonstration collected do not have good action labels (e.g. too noisy), or the actions are in a undesired action space. RFCL lets you train on purely environment states in demonstrations, which lets you learn RL policies in any feasible action space and is independent of action label quality.

This will be less sample-efficient but can work when given sufficient compute and is likely the only open-sourced sim-based algorithm capable of learning without actions in demonstrations, just environment states.

## Testing on New Environments 🌏 / Customization 🎨

### Adding a new Environment
Expand Down

0 comments on commit b15eff6

Please sign in to comment.