You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The action space in "mountaincarcontinuous" environment accepts rewards between [-1, 1]. However, in the code, there is no control for that. If the user wants to take action like 3, the environment successfully returns the next state and reward. However, this may lead to some problems in terms of the user if he/she made a mistake during discretization. I think I would be good to add this control into the code base and warn the user.
I believe we have avoiding checking the validity of observations, rewards, and actions in existing environments. It should be possible to construct a wrapper that checks these and throws an exception if they are outside of the expected range.
The action space in "mountaincarcontinuous" environment accepts rewards between [-1, 1]. However, in the code, there is no control for that. If the user wants to take action like 3, the environment successfully returns the next state and reward. However, this may lead to some problems in terms of the user if he/she made a mistake during discretization. I think I would be good to add this control into the code base and warn the user.
The text was updated successfully, but these errors were encountered: