Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 596 Bytes

File metadata and controls

12 lines (8 loc) · 596 Bytes

Trust-Region-Policy-Optimization

My attepmt at a TRPO implementation in pytorch. :)

The implementation is inspired from UC Berkeley's Deep RL Bootcamp's assignments and the following TRPO implementations by ikostrikov , mjacar and the original implementation by John Schulman.

Running Instructions

python main.py

All parameters exist in trpo_agent.py