You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was using Mujoco simulator 'InvertedPendulum-v1' on Open AI Gym, and I noticed an interesting issue. I checked the action space bounds with the following code:
It showed that the valid action should be within the range of [-3, 3]. However, if action is not within the range, it looks still working. For example,
action = 40
observation, reward, done, info = env.step(action)
So I wonder if there is any reasons for this issue, and if the issue is for all Mujoco simulators.
The text was updated successfully, but these errors were encountered:
@RuofanKong I think MuJoCo will internally clip the action to be 3 in that case (the maximum). However, I couldn't find the answer by browsing in the source code. It seems logical but it would be great if someone could confirm!
Other Mujoco tasks, such as "Swimmer-v1", "Walker2d-v1", action space is also bounded too, specifically [-1,1] high dimension cube. can not confirm whether clip suppose is right here.
I proposed on this other issue: #1442 that there could be a wrapper that checks for invalid obs, rew, act values and produce an error for the user. There's some more discussion on baselines about how the agent should deal with this: openai/baselines#121
I was using Mujoco simulator 'InvertedPendulum-v1' on Open AI Gym, and I noticed an interesting issue. I checked the action space bounds with the following code:
It showed that the valid action should be within the range of [-3, 3]. However, if action is not within the range, it looks still working. For example,
So I wonder if there is any reasons for this issue, and if the issue is for all Mujoco simulators.
The text was updated successfully, but these errors were encountered: