Skip to content

Latest commit

 

History

History
24 lines (13 loc) · 396 Bytes

README.md

File metadata and controls

24 lines (13 loc) · 396 Bytes

Reinforcement.Learning

Contributions are welcome

Progress

    • Deep Q Network
    • Dueling Q Network
    • Policy Gradient: REINFORCE
    • Advantage Actor-Critic
    • Deep Deterministic Policy Gradient

TODO

    • Asynchronous Advantage Actor-Critic (A3C)
    • Estimate the concrete performance of each algorithms

Licence

MIT Licence