Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

action constraints in PPO #320

Open
niklasnolte opened this issue Mar 1, 2020 · 4 comments
Open

action constraints in PPO #320

niklasnolte opened this issue Mar 1, 2020 · 4 comments
Labels
type:feature request New feature or request

Comments

@niklasnolte
Copy link

Hi, what is the best way to implement action constraints in a PPOAgent?
For a QPolicy i can use observation_and_action_constraint_splitter. Is there something equivalent for ppo policies?

@oars
Copy link
Contributor

oars commented Mar 2, 2020

For an environment that has discrete actions you could do a similar pattern. In the case of continuous actions it gets a bit more tricky, you could use truncated normals for the distribution for example.

@oars oars added the type:feature request New feature or request label Mar 2, 2020
@kuanghuei
Copy link
Contributor

kuanghuei commented Mar 2, 2020

#216 - for continuous actions

@niklasnolte
Copy link
Author

niklasnolte commented Mar 7, 2020

i have a discrete pattern case. but i was wondering about the technical part, meaning is there a feature that implements the observation_and_action_constraint_splitter functionality for PPOAgents?
Given that you added the label, i guess not yet.

@JasonHuang2000
Copy link

JasonHuang2000 commented Apr 3, 2022

Hello @niklasnolte, I want to train my PPO agent on an custom environment with discrete action space, while its performance will be compared with DQN agent. Have you figured out how to apply action constraints on PPO agent? Thanks a lot!

edit: For anyone who encountered the same problem, I found a possible solution in #452.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type:feature request New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants