You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I setup a custom environment with my observation and action spaces as such:
high = np.array([1.5, 1.5, 1.5,1.5])
action_space = spaces.Box(low=np.array([0,0]), high=np.array([0.05,2*np.pi]), dtype=np.float32)
observation_space = spaces.Box(low=-high, high=high, dtype=np.float32)
However, when using my environment with PPO2, it seems to be willingly violating these. I tried debugging the script and stopped it within model.py and policies.py but the values remain unchanged. At line 52 of policies,py, though, the sampled action violates the provided space.
Can anyone assist me in figuring out what's going on? Just let me know if you need anything else. Thanks!
The text was updated successfully, but these errors were encountered:
joellutz's solution looks like it is for ddpg, so it might not work with ppo2. If it doesn't, a clipping wrapper on the environment as proposed by olegklimov may be a good solution.
Hello, I setup a custom environment with my observation and action spaces as such:
high = np.array([1.5, 1.5, 1.5,1.5])
action_space = spaces.Box(low=np.array([0,0]), high=np.array([0.05,2*np.pi]), dtype=np.float32)
observation_space = spaces.Box(low=-high, high=high, dtype=np.float32)
However, when using my environment with PPO2, it seems to be willingly violating these. I tried debugging the script and stopped it within model.py and policies.py but the values remain unchanged. At line 52 of policies,py, though, the sampled action violates the provided space.
Can anyone assist me in figuring out what's going on? Just let me know if you need anything else. Thanks!
The text was updated successfully, but these errors were encountered: