AlgeaDICE

Sep 24, 2020

e95e865 · Sep 24, 2020

Name	Name	Last commit message	Last commit date
parent directory ..
wrappers	wrappers	update	Sep 24, 2020
README.md	README.md	Update README.md	Sep 24, 2020
__init__.py	__init__.py	Update __init__.py	Sep 24, 2020
algae.py	algae.py	Update algae.py	Sep 24, 2020
keras_utils.py	keras_utils.py	Update keras_utils.py	Sep 24, 2020
requirements.txt	requirements.txt	update	Sep 24, 2020
train_eval.py	train_eval.py	Update train_eval.py	Sep 24, 2020
utils.py	utils.py	update	Sep 24, 2020

README.md

AlgaeDICE

PyTorch Code Implementation for AlgaeDICE as described in the paper:

`AlgaeDICE: Policy Gradient from Arbitrary Experience' by Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, and Dale Schuurmans.
Paper available on arXiv here.
Original code implementation in Tensorflow is here

You can site the code base:

@misc{pytorchrl,
  author = {Arnob, SY},
  title = {PyTorch Implementations of DICE Algorithms},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/SaminYeasar/PyTorch-implementation-DICE-algorithms}},
}

Basic Commands

Run AlgaeDICE on HalfCheetah:

python -m algae_dice.train_eval --logtostderr --save_dir=$HOME/algae/ \
    --env_name=HalfCheetah-v2 --seed=42

Important tricks

Doubel-Q learning and Mixed critic update is important for training algeaDICE
Unlike original implementation, there's no separate buffer to store initial states, here we can consider each state as initial state to the agent. Similar assumption is made in [here] (https://arxiv.org/abs/1912.05032)

Performance comparison with the original implementation

Performance in compared on seed (0-5) with std 1 over 100k timesteps. (Often paper plot with 75% of the variance)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

AlgeaDICE

AlgeaDICE

README.md

AlgaeDICE

Basic Commands

Important tricks

Performance comparison with the original implementation

Files

AlgeaDICE

Directory actions

More options

Directory actions

More options

Latest commit

History

AlgeaDICE

Folders and files

parent directory

README.md

AlgaeDICE

Basic Commands

Important tricks

Performance comparison with the original implementation