action constraints in PPO #320

niklasnolte · 2020-03-01T18:26:31Z

Hi, what is the best way to implement action constraints in a PPOAgent?
For a QPolicy i can use observation_and_action_constraint_splitter. Is there something equivalent for ppo policies?

The text was updated successfully, but these errors were encountered:

oars · 2020-03-02T16:48:23Z

For an environment that has discrete actions you could do a similar pattern. In the case of continuous actions it gets a bit more tricky, you could use truncated normals for the distribution for example.

kuanghuei · 2020-03-02T22:09:06Z

#216 - for continuous actions

niklasnolte · 2020-03-07T17:21:59Z

i have a discrete pattern case. but i was wondering about the technical part, meaning is there a feature that implements the observation_and_action_constraint_splitter functionality for PPOAgents?
Given that you added the label, i guess not yet.

JasonHuang2000 · 2022-04-03T09:05:56Z

Hello @niklasnolte, I want to train my PPO agent on an custom environment with discrete action space, while its performance will be compared with DQN agent. Have you figured out how to apply action constraints on PPO agent? Thanks a lot!

edit: For anyone who encountered the same problem, I found a possible solution in #452.

oars added the type:feature request New feature or request label Mar 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

action constraints in PPO #320

action constraints in PPO #320

niklasnolte commented Mar 1, 2020

oars commented Mar 2, 2020

kuanghuei commented Mar 2, 2020 •

edited

Loading

niklasnolte commented Mar 7, 2020 •

edited

Loading

JasonHuang2000 commented Apr 3, 2022 •

edited

Loading

action constraints in PPO #320

action constraints in PPO #320

Comments

niklasnolte commented Mar 1, 2020

oars commented Mar 2, 2020

kuanghuei commented Mar 2, 2020 • edited Loading

niklasnolte commented Mar 7, 2020 • edited Loading

JasonHuang2000 commented Apr 3, 2022 • edited Loading

kuanghuei commented Mar 2, 2020 •

edited

Loading

niklasnolte commented Mar 7, 2020 •

edited

Loading

JasonHuang2000 commented Apr 3, 2022 •

edited

Loading