Added Saving functionality. #3

rudrasohan · 2018-11-30T05:22:25Z

Why?
There was no previous option to support the saving of trained networks(ddpg). Hence to add that functionality.

What?
Built a saving function where you can manually specify a dir for the network to save and can also load the desired model back.

Testing

For saving:
Specify the path of the save directory in the field specify/path in the ddpg.py. Then run the code as usual. python -m baselines.run --alg='ddpg' --env='Madras-v0'

For loading:
python -m baselines.run --alg='ddpg' --env='Madras-v0' --load_path=/specify/path/to/save/file

NOTE:
The save function takes in a dir whereas the load function takes in a file.
This PR also includes #1 as that will be required for madras-env to be working.

…to comp

buridiaditya · 2018-11-30T13:50:26Z

baselines/ddpg/ddpg.py

@@ -42,6 +42,8 @@ def learn(network, env,
          tau=0.01,
          eval_env=None,
          param_noise_adaption_interval=50,
+          load_path = None,
+          save_path = '<specify/path>'


Missed a comma

buridiaditya · 2018-11-30T13:51:13Z

baselines/ddpg/ddpg.py

@@ -269,5 +274,10 @@ def as_scalar(x):
                with open(os.path.join(logdir, 'eval_env_state.pkl'), 'wb') as f:
                    pickle.dump(eval_env.get_state(), f)

+            os.mkdirs(logdir,exist_ok=True)


Error in python mkdirs does not exist

buridiaditya

Error :
Traceback (most recent call last): File "/usr/home/brahma/anaconda3/envs/torcs/lib/python3.6/runpy.py", line 193, in _run_module_as_main "__main__", mod_spec) File "/usr/home/brahma/anaconda3/envs/torcs/lib/python3.6/runpy.py", line 85, in _run_code exec(code, run_globals) File "/usr/home/brahma/BURIDI/MADRAS/baselines/baselines/run.py", line 226, in <module> main(sys.argv) File "/usr/home/brahma/BURIDI/MADRAS/baselines/baselines/run.py", line 198, in main model, env = train(args, extra_args) File "/usr/home/brahma/BURIDI/MADRAS/baselines/baselines/run.py", line 81, in train **alg_kwargs File "/usr/home/brahma/BURIDI/MADRAS/baselines/baselines/ddpg/ddpg.py", line 98, in learn agent.load(load_path) TypeError: 'NoneType' object is not callable

Recreation:

Ran the training with save path mentioned it created files with epoch numbers.
Load any of the file to recreate the error. Loading using this command
python -m baselines.run --alg='ddpg' --env='Madras-v0' --load_path='file_path'
and also tried directly setting the load_path in the code.

hari-sikchi · 2018-12-07T06:05:30Z

If you select the noise type as parameter noise, then there is a variable in agent called agent.param_noise.current_stddev, which is not a tensor, so when you save the trained agent and do the restoring, this parameter will not be recovered.
If you save/restore the policy as in Saving and restoring DDPG agent openai/baselines#162, if normalize_observations=True (the default value), then the mean/std used for normalization will not be recoverable.

Restating some issues here for convenience. @rudrasohan

Supports for Madras Env

67df72f

rudrasohan added the enhancement New feature or request label Nov 30, 2018

rudrasohan self-assigned this Nov 30, 2018

rudrasohan requested review from MehaKaushik and buridiaditya November 30, 2018 05:22

buridiaditya mentioned this pull request Nov 30, 2018

Supports for Madras Env #1

Closed

rudrasohan added 2 commits November 30, 2018 15:56

Merge branch 'master' of https://github.com/buridiaditya/baselines in…

2d5593f

…to comp

added saving function for ddpg

bba5cfc

buridiaditya reviewed Nov 30, 2018

View reviewed changes

rudrasohan added 2 commits December 1, 2018 04:21

fixed typos

3e28aee

fixed typos 1

f66db73

buridiaditya requested changes Dec 1, 2018

View reviewed changes

rudrasohan added 3 commits December 1, 2018 23:23

fixed loader

44e1b53

prevent file overwriting

a33f0e1

removed specific path

0ddec01

buridiaditya approved these changes Dec 5, 2018

View reviewed changes

buridiaditya merged commit 198bbed into madras-simulator:master Dec 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Saving functionality. #3

Added Saving functionality. #3

rudrasohan commented Nov 30, 2018 •

edited

Loading

buridiaditya Nov 30, 2018

buridiaditya Nov 30, 2018

buridiaditya left a comment

hari-sikchi commented Dec 7, 2018

Added Saving functionality. #3

Added Saving functionality. #3

Conversation

rudrasohan commented Nov 30, 2018 • edited Loading

buridiaditya Nov 30, 2018

Choose a reason for hiding this comment

buridiaditya Nov 30, 2018

Choose a reason for hiding this comment

buridiaditya left a comment

Choose a reason for hiding this comment

hari-sikchi commented Dec 7, 2018

rudrasohan commented Nov 30, 2018 •

edited

Loading