rl_toolbox - Actual implementation of PPO for quadrupedal locomotion

The name rl_toolbox comes from the many implementations of RL algorithms that I tested before I setteled with PPO implemented by stable baseline 3.

This repo contains the environment as well as the helper algorithms necessary to train a quadruped in simulation and deploy the neural networks on a real machine.

Key features

Gym environment to simulate a quadruped (specifically Idef'X) using the simulator Erquy.
Helper functions to calculate IK and build a small library of motions for the RL to be based on.
Small reimplementation of PPO by stable baseline 3 to enforce symetrical gait and avoid very large kl-divergence.
Transfer algorithm to train a student that can only access measurable physical properties against a teacher trained in RL that is given the full space of observations. Inspiration heavily drawn from this paper.

Resulting policy

The baseline policy we can get is a simple walking policy, robust to small push on the robot.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
environments/dog_env		environments/dog_env
erquy_py		erquy_py
gifs		gifs
src		src
.gitignore		.gitignore
README.md		README.md
config.py		config.py
generate_traj.py		generate_traj.py
models.py		models.py
my_ppo.py		my_ppo.py
setup.sh		setup.sh
test_student.py		test_student.py
test_teacher.py		test_teacher.py
test_teacher_full.py		test_teacher_full.py
train_student.py		train_student.py
train_teacher.py		train_teacher.py
tv.py		tv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rl_toolbox - Actual implementation of PPO for quadrupedal locomotion

Key features

Resulting policy

About

Releases

Packages

Languages

o-Oscar/rl_toolbox

Folders and files

Latest commit

History

Repository files navigation

rl_toolbox - Actual implementation of PPO for quadrupedal locomotion

Key features

Resulting policy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages