Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Some questions about the network layer of parking env (SAC+HER)

See original GitHub issue

Hi, i tried to run the parking env and got the success rate of over 95%. But I still have some questions about the model structure. The model as follows :

model = HER('MlpPolicy', env, SAC, n_sampled_goal=4,
            goal_selection_strategy='future',
            verbose=1, buffer_size=int(1e6),
            learning_rate=1e-3, gamma=0.9, batch_size=256,
            policy_kwargs=dict(layers=[256, 256, 256]))

I’m not really understanding about policy_kwargs. I think it is the parameter to setup model or create policy and target TF objects, but i am not sure. This model does not only have one network makes me confuse. May i ask you about the neural network architecture of this env ? If you have time to answer, I hope to figure out how many layers are there and how many neurons in each layer.

Looking forward to your reply, thanks.

Issue Analytics

State:
Created 3 years ago
Comments:7 (4 by maintainers)

Top GitHub Comments

1reaction

eleurentcommented, Aug 5, 2020

No, for this particular task I already had satisfying results with Mlp so I did not try more sophisticated models (plus the observation is quite small and unstructured).

For the intersection environment however, I compared MLP with CNN, as well as an attention-based model, see paper and results (with a runnable colab notebook).

1reaction

eleurentcommented, Aug 3, 2020

env.observation_space defines the inputs of the policy, while env.action_space defines its outputs.

env.observation_space is typically a continuous Box, and you can get its dimension by calling env.observation_space.shape
env.action_space can be a Box, but also a Discrete space, in which case you can get its size by calling env.action_space.n

Top Results From Across the Web

Neural-Network-Based Dynamic Distribution Model of Parking ...

A neural-network-based dynamic parking distribution model was developed considering seven influencing factors: driving duration, walking distance, parking fee, ...

Machine Learning Technologies for Secure Vehicular ...

One of the vital questions that the researchers need to address is how to effectively handle the privacy of large groups of data...

Outside parking Wireless Coverage Best Access Points and ...

Experts, I hope all is well, what would be a recommended Access Point/Antenna to cover a large parking lot. I was thinking a...

A Reservation-based Smart Parking System

concerns on traffic congestion and environmental protection. • Walking Distance: Walking distance ... As the middle layer between the sensor network and web....

A novel parking-lot intelligent selection algorithm based on ...

This paper aims to meet the requirements for the evaluation and selection of the optimal parking lot near the destination, and establishes a ......