Replies: 2 comments 3 replies
-
Hi shaoxiang, have you found a solution? I also find it hard to use Dict observations. |
Beta Was this translation helpful? Give feedback.
-
FANG-Zhiwei is right, num_states is the number of priviledged observations used for an actor-critic policy. (these are observations used to speed up training that are not available to the final policy, such as measurements that are available in simulation but not in the real world. Are you using a managed or direct environment? Can you elaborate on the issue you're having? I'm working in on direct environment right now and it seems fairly straight-forward to bring observations together from different sources. |
Beta Was this translation helpful? Give feedback.
-
I see that the demos in the Lab are all single_observation_space, but my experiment is multiple observation_space. Combine images and vectors into a multi-input observation space. I want to understand how to develop it. Can anyone help me? Also, I find it difficult to understand the meaning of num_states and there is no relevant explanation.
The following is the approach I found for Stable Baselines3:
Stable Baselines3 supports handling of multiple inputs by using
Dict
Gym space. This can be done usingMultiInputPolicy
, which by default uses theCombinedExtractor
features extractor to turn multipleinputs into a single vector, handled by the
net_arch
network.Beta Was this translation helpful? Give feedback.
All reactions