Adding support for adaptive behavior cloning #153

abhiksingla · 2022-06-29T00:21:26Z

Why are these changes needed?

The change provides the support to run behavior cloning steps (pre-training stage for Offline-RL training) dynamically according to the data size. In the previous implementation, behavior cloning is done for a fixed number of steps. The changes will set behavior cloning iters dynamically according to the size of the dataset.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

RuofanKong

looks good.

Adding support for adaptive behavior cloning

e5e298b

abhiksingla requested a review from a team as a code owner June 29, 2022 00:21

abhiksingla requested review from RuofanKong and akanso June 29, 2022 00:21

Abhik Singla added 2 commits June 28, 2022 17:24

cleanup

ef8147b

Support for bc epochs

b3a46ac

abhiksingla requested a review from dmlyubim June 29, 2022 01:13

Abhik Singla added 2 commits June 28, 2022 18:53

fix

f1d5c98

remove redundant experimental tag

4c8a8cc

RuofanKong approved these changes Jul 11, 2022

View reviewed changes

abhiksingla requested a review from Kiko-Aumond July 14, 2022 17:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding support for adaptive behavior cloning #153

Adding support for adaptive behavior cloning #153

abhiksingla commented Jun 29, 2022

Uh oh!

RuofanKong left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding support for adaptive behavior cloning #153

Are you sure you want to change the base?

Adding support for adaptive behavior cloning #153

Conversation

abhiksingla commented Jun 29, 2022

Why are these changes needed?

Related issue number

Checks

Uh oh!

RuofanKong left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants