We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
i am testing my AC code using Pendulum-v0 environment and got this error:
gym\envs\classic_control\pendulum.py:88: RuntimeWarning: invalid value encountered in remainder return (((x+np.pi) % (2*np.pi)) - np.pi)
Afterwhich my rewards during training were [nan]: Episode 23 of 100, score [nan]. Would anyone know what might be the cause for this?
The text was updated successfully, but these errors were encountered:
Are you clipping your actions to within the allowed range for the action space of the environment? https://github.com/openai/gym/blob/master/gym/envs/classic_control/pendulum.py#L20
Sorry, something went wrong.
Closing due to lack of activity. Possibly related issue: openai/baselines#121
No branches or pull requests
i am testing my AC code using Pendulum-v0 environment and got this error:
gym\envs\classic_control\pendulum.py:88: RuntimeWarning: invalid value encountered in remainder
return (((x+np.pi) % (2*np.pi)) - np.pi)
Afterwhich my rewards during training were [nan]: Episode 23 of 100, score [nan]. Would anyone know what might be the cause for this?
The text was updated successfully, but these errors were encountered: