ReLoD: The Remote-Local Distributed System for Real-time Reinforcement Learning on Vision-Based Robotics Tasks
ReLoD uses a wired local and a wireless remote computer to perform real-time learning, an appealing setting for industrial learning systems. It is a generalist RL system for learning with real robots from scratch! Check out how ReLoD learns to perform vision-based tasks on UR5 and Roomba (iRobot Create 2): Youtube video
- Soft Actor Critic (SAC)
- Proximal Policy Optimization (PPO)
N.B: All vision-based experiments use Random Augmented Data (RAD) to improve sample efficiency
UR-Reacher |
Franka-VisualReacher |
|---|
Create-Reacher |
Vector-ChargerDetector |
|---|
| Hyper-parameter | Value |
|---|---|
| Replay buffer | 100K |
| Actor step size | 3e-4 |
| Critic step size | 3e-4 |
| Entropy coefficient step size | 3e-4 |
| Batch size | 256 |
| Discount factor | 0.99 |
| Update every |
2 |
| Num. update epochs every |
1 |
| Actor MLP hidden sizes | [512 512] |
| Critic MLP hidden sizes | [512 512] |
| Warm-up time steps | 1000 |
| Adam optimizer betas | [0.9, 0.999] |
| Initial temperature | 0.1 |
| Neural network activation | ReLU |
- Download Mujoco and license files to ~/.mujoco
- Install miniconda or anaconda
- Create a virtual environment:
conda create --name myenv python=3.6 # Python 3.6 is necessary
conda activate myenv- Add the following to ~/.bashrc:
conda activate myenv
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/<username>/.mujoco/mjpro210/bin # Change based on mujoco version
export MUJOCO_GL="egl" # System specificand run:
source ~/.bashrc- Install packages with:
pip install -r requirements.txt
pip install . python task_ur5_visual_reacher.py --work_dir "./results" --mode 'l' --seed 0 --env_steps 200100 The code for the Franka task can be found in this branch.
Wang, Y.⋆, Vasan, G.⋆, & Mahmood, A. R. (2023). Real-time reinforcement learning for vision-based robotics utilizing local and remote computers. In Proceedings of the 2023 International Conference on Robotics and Automation (ICRA).



