question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Some questions about the network layer of parking env (SAC+HER)

See original GitHub issue

Hi, i tried to run the parking env and got the success rate of over 95%. But I still have some questions about the model structure. The model as follows :

model = HER('MlpPolicy', env, SAC, n_sampled_goal=4,
            goal_selection_strategy='future',
            verbose=1, buffer_size=int(1e6),
            learning_rate=1e-3, gamma=0.9, batch_size=256,
            policy_kwargs=dict(layers=[256, 256, 256]))          

I’m not really understanding about policy_kwargs. I think it is the parameter to setup model or create policy and target TF objects, but i am not sure. This model does not only have one network makes me confuse. May i ask you about the neural network architecture of this env ? If you have time to answer, I hope to figure out how many layers are there and how many neurons in each layer.

Looking forward to your reply, thanks.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
eleurentcommented, Aug 5, 2020

No, for this particular task I already had satisfying results with Mlp so I did not try more sophisticated models (plus the observation is quite small and unstructured).

For the intersection environment however, I compared MLP with CNN, as well as an attention-based model, see paper and results (with a runnable colab notebook).

1reaction
eleurentcommented, Aug 3, 2020

env.observation_space defines the inputs of the policy, while env.action_space defines its outputs.

  • env.observation_space is typically a continuous Box, and you can get its dimension by calling env.observation_space.shape
  • env.action_space can be a Box, but also a Discrete space, in which case you can get its size by calling env.action_space.n
Read more comments on GitHub >

github_iconTop Results From Across the Web

Neural-Network-Based Dynamic Distribution Model of Parking ...
A neural-network-based dynamic parking distribution model was developed considering seven influencing factors: driving duration, walking distance, parking fee, ...
Read more >
Machine Learning Technologies for Secure Vehicular ...
One of the vital questions that the researchers need to address is how to effectively handle the privacy of large groups of data...
Read more >
Outside parking Wireless Coverage Best Access Points and ...
Experts, I hope all is well, what would be a recommended Access Point/Antenna to cover a large parking lot. I was thinking a...
Read more >
A Reservation-based Smart Parking System
concerns on traffic congestion and environmental protection. • Walking Distance: Walking distance ... As the middle layer between the sensor network and web....
Read more >
A novel parking-lot intelligent selection algorithm based on ...
This paper aims to meet the requirements for the evaluation and selection of the optimal parking lot near the destination, and establishes a ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found