One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

Official code to reproduce the experiments in the paper "One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning ".

Installation

Install MuJoCo 2.1.0 to ~/.mujoco/mujoco210.
Create a conda environment and install 1R2R:

cd 1R2R
conda create --name 1R2R python=3.7
conda activate 1R2R
pip install -e .
pip install -r requirements.txt

Datasets

The datasets introduced for stochastic domains can be found on the HuggingFace Hub as well as Google Drive. By default, the code expects that the datasets are located in the folder 1R2R/datasets.

Usage

Configuration files can be found in examples/config/. For example, to run the stochastic hopper-medium-replay task with high noise, use the following:

1R2R run_example examples.development --config examples.config._1R2R.stochastic_mujoco.hopper_high_noise_medium_replay --seed 0 --gpus 1

If importlib is unable to import the desired config file, this can be resolved by adding to the PYTHONPATH:

export PYTHONPATH="${PYTHONPATH}:/path/to/1R2R"

Logging

By default, TensorBoard logs are generated in the "logs" directory. The code is also set up to log using Weights and Biases (WandB). To enable the use of WandB, set "log_wandb" to True in the configuration file.

Citing 1R2R

@article{rigter2023,
  title={One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning},
  author={Rigter, Marc and Lacerda, Bruno and Hawes, Nick},
  journal={Advances in Neural Information Processing Systems},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
_1R2R		_1R2R
examples		examples
softlearning		softlearning
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_example.sh		run_example.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

Installation

Datasets

Usage

Logging

Citing 1R2R

About

Releases

Packages

Languages

License

marc-rigter/1R2R

Folders and files

Latest commit

History

Repository files navigation

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

Installation

Datasets

Usage

Logging

Citing 1R2R

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages