[question] VecNormalize with hyper parameter tuning
See original GitHub issueHi, I’m trying to do some hyper parameter tuning, but am getting this error:
ValueError("Trying to set venv of already initialized VecNormalize wrapper.")
How am I supposed to use the normalization statistics from the training phase during validation without saving it to file during every trial?
def objective_fn(trial):
model_params = optimize_ppo(trial)
train_env, validation_env = initialize_envs()
norm_env = VecNormalize(train_env, norm_obs=True, norm_reward=True, training=True)
model = PPO(policy,
norm_env,
device=device,
**model_params)
train_maxlen = len(train_env.get_attr('df')[0].index) - 1
try:
model.learn(train_maxlen)
except Exception as error:
print(error)
raise optuna.structs.TrialPruned()
norm_env.set_venv(validation_env)
norm_env.training = False
norm_env.norm_reward = False
mean_reward, _ = evaluate_policy(model, norm_env, n_eval_episodes=5)
if mean_reward == 0:
raise optuna.structs.TrialPruned()
return -mean_reward
Issue Analytics
- State:
- Created 2 years ago
- Comments:6
Top Results From Across the Web
What does the hyperparameter "normalize" refer to in PPO? #64
e.g. I see A2C tunes the normalize_advantage parameter, but that's not a hyperparameter for PPO. PPO has a boolean to normalize_image, but don't ......
Read more >Reinforcement Learning Tips and Tricks - Stable Baselines
Read about RL and Stable Baselines · Do quantitative experiments and hyperparameter tuning if needed · Evaluate the performance using a separate test...
Read more >HyperParameter Tuning, Batch Normalization - Kaggle
This problem is known as covariant shift where there is a shift in the data ... Hyperparameter tuning: methods to search for optimal...
Read more >Hyperparameter tuning - GeeksforGeeks
The aim of this article is to explore various strategies to tune hyperparameters for Machine learning models.
Read more >03_hyperparameter-tuning-batch-normalization-and ...
In this video, I want to share with you some guidelines, some tips for how to systematically organize your hyperparameter tuning process, ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
You will find your answer here 😉: https://github.com/DLR-RM/stable-baselines3/issues/473 (we have a
sync_env_normalization
for that.https://github.com/DLR-RM/stable-baselines3/blob/75b6f3b3b0f207456d9dcac2c6e86e8e2a22115f/stable_baselines3/common/vec_env/__init__.py#L59-L72
Thank you both very much!