Reinforcement learning algorithms and experiments in Python

This repository contains Python code that can be used in order to experiment with reinforcement learning in Python. The code is organized in several components that can be mix and matched. For instance, different kinds of RL algorithms (Q-Learning, advantage, etc) can be tested on a specific world or problem. An algorithm can also be configured to use one of the possible models (Q-Learning can store the Q values in a simple dictionary, or using different kinds of function approximation methods).

AbstractWorld: Environment and behavior of an agent. The world defines the number of possible actions, and produces observations and rewards when actions are carried out.
AbstractLearning: Observes states and rewards and choose actions to perform.
AbstractModel: Stores and retrieve values. For instance, a model is used to associate Q values to (state, action) pairs. A model can be discrete or based on function approximation.

Dependencies

This project uses several machine-learning Python libraries. Most of them are optional, the program being able to run (with limited functionality) without them. Here is the list of dependencies, with instructions about how to install them.

NumPy : Available on PyPi (numpy)
Matplotlib : Available on PyPi (matplotlib)
Theano (optional) : Available on PyPi (Theano)
Keras (optional) : Available on PyPi (Keras)
FANN2 (optional) : Available on PyPi (fann2)
rlglue-py3 (optional) : https://github.com/steckdenis/rlglue-py3 . Python bindings for Python exist for some time but were never ported to Python3
rlglue-py (optional) : Python 2 version of rl-glue, can be used if you run this code with Python 2
rospy (optional) : Allows ROSWorld to be used. If your rospy is based on Python 2, then this project will also have to be run using Python 2.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
learning		learning
model		model
texplore		texplore
world		world
.gitignore		.gitignore
README.md		README.md
main.py		main.py
run_all.serial.sh		run_all.serial.sh
run_all.sh		run_all.sh
vrep_inverted_pendulum.ttt		vrep_inverted_pendulum.ttt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement learning algorithms and experiments in Python

Dependencies

About

Releases

Packages

Languages

steckdenis/rlpy

Folders and files

Latest commit

History

Repository files navigation

Reinforcement learning algorithms and experiments in Python

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages