Some questions about the network layer of parking env (SAC+HER)
See original GitHub issueHi, i tried to run the parking env
and got the success rate of over 95%. But I still have some questions about the model structure.
The model as follows :
model = HER('MlpPolicy', env, SAC, n_sampled_goal=4,
goal_selection_strategy='future',
verbose=1, buffer_size=int(1e6),
learning_rate=1e-3, gamma=0.9, batch_size=256,
policy_kwargs=dict(layers=[256, 256, 256]))
I’m not really understanding about policy_kwargs
. I think it is the parameter to setup model or create policy and target TF objects, but i am not sure. This model does not only have one network makes me confuse.
May i ask you about the neural network architecture of this env ? If you have time to answer, I hope to figure out how many layers are there and how many neurons in each layer.
Looking forward to your reply, thanks.
Issue Analytics
- State:
- Created 3 years ago
- Comments:7 (4 by maintainers)
Top Results From Across the Web
Neural-Network-Based Dynamic Distribution Model of Parking ...
A neural-network-based dynamic parking distribution model was developed considering seven influencing factors: driving duration, walking distance, parking fee, ...
Read more >Machine Learning Technologies for Secure Vehicular ...
One of the vital questions that the researchers need to address is how to effectively handle the privacy of large groups of data...
Read more >Outside parking Wireless Coverage Best Access Points and ...
Experts, I hope all is well, what would be a recommended Access Point/Antenna to cover a large parking lot. I was thinking a...
Read more >A Reservation-based Smart Parking System
concerns on traffic congestion and environmental protection. • Walking Distance: Walking distance ... As the middle layer between the sensor network and web....
Read more >A novel parking-lot intelligent selection algorithm based on ...
This paper aims to meet the requirements for the evaluation and selection of the optimal parking lot near the destination, and establishes a ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
No, for this particular task I already had satisfying results with
Mlp
so I did not try more sophisticated models (plus the observation is quite small and unstructured).For the intersection environment however, I compared MLP with CNN, as well as an attention-based model, see paper and results (with a runnable colab notebook).
env.observation_space
defines the inputs of the policy, whileenv.action_space
defines its outputs.env.observation_space
is typically a continuousBox
, and you can get its dimension by callingenv.observation_space.shape
env.action_space
can be aBox
, but also aDiscrete
space, in which case you can get its size by callingenv.action_space.n