Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Question] Error after loading model with FinRL

See original GitHub issue

Question

I was trying out FinRL library which uses stable-baselines3 under the hood. I encountered below error on model.load() (PPO) call. I have asked this question on FinRL here. But didnt get any response there. Possibly because its more related to stable-baselines3 (at least thats what I can deduce by looking at the below stack trace).

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-94-53a21a75c23f> in <module>
----> 1 model.learn(total_timesteps = 20000, 
      2             eval_env = env_trade,
      3             eval_freq = 250,
      4             log_interval = 1,
      5             tb_log_name = '1_18_lastrun',

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/ppo/ppo.py in learn(self, total_timesteps, callback, log_interval, eval_env, eval_freq, n_eval_episodes, tb_log_name, eval_log_path, reset_num_timesteps)
    278     ) -> "PPO":
    279 
--> 280         return super(PPO, self).learn(
    281             total_timesteps=total_timesteps,
    282             callback=callback,

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/common/on_policy_algorithm.py in learn(self, total_timesteps, callback, log_interval, eval_env, eval_freq, n_eval_episodes, tb_log_name, eval_log_path, reset_num_timesteps)
    245                 logger.dump(step=self.num_timesteps)
    246 
--> 247             self.train()
    248 
    249         callback.on_training_end()

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/ppo/ppo.py in train(self)
    189                     self.policy.reset_noise(self.batch_size)
    190 
--> 191                 values, log_prob, entropy = self.policy.evaluate_actions(rollout_data.observations, actions)
    192                 values = values.flatten()
    193                 # Normalize advantage

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/common/policies.py in evaluate_actions(self, obs, actions)
    619         """
    620         latent_pi, latent_vf, latent_sde = self._get_latent(obs)
--> 621         distribution = self._get_action_dist_from_latent(latent_pi, latent_sde)
    622         log_prob = distribution.log_prob(actions)
    623         values = self.value_net(latent_vf)

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/common/policies.py in _get_action_dist_from_latent(self, latent_pi, latent_sde)
    581 
    582         if isinstance(self.action_dist, DiagGaussianDistribution):
--> 583             return self.action_dist.proba_distribution(mean_actions, self.log_std)
    584         elif isinstance(self.action_dist, CategoricalDistribution):
    585             # Here mean_actions are the logits before the softmax

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/stable_baselines3/common/distributions.py in proba_distribution(self, mean_actions, log_std)
    150         """
    151         action_std = th.ones_like(mean_actions) * log_std.exp()
--> 152         self.distribution = Normal(mean_actions, action_std)
    153         return self
    154 

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/torch/distributions/normal.py in __init__(self, loc, scale, validate_args)
     48         else:
     49             batch_shape = self.loc.size()
---> 50         super(Normal, self).__init__(batch_shape, validate_args=validate_args)
     51 
     52     def expand(self, batch_shape, _instance=None):

~/.local/share/virtualenvs/myvenv/lib/python3.8/site-packages/torch/distributions/distribution.py in __init__(self, batch_shape, event_shape, validate_args)
     51                     continue  # skip checking lazily-constructed args
     52                 if not constraint.check(getattr(self, param)).all():
---> 53                     raise ValueError("The parameter {} has invalid values".format(param))
     54         super(Distribution, self).__init__()
     55 

ValueError: The parameter loc has invalid values

Can someone give me hint / direction as to what have caused this error?

Checklist

I have read the documentation (required)
I have checked that there is no similar issue in the repo (required)

Issue Analytics

State:
Created 2 years ago
Reactions:1
Comments:5

Top GitHub Comments

3reactions

DishinGoyanicommented, May 9, 2021

NaN values in Normal can cause ValueError. I guess during training, model or policy must be returning NaN in mean_actions which eventually passes into Normal so an exception occurs.

1reaction

SolaWengcommented, Aug 30, 2021

I had a similar issue with a custom environment. It turns out that I had nan in my observation. here is the wrapper from stable-baselines3 to check where the nan comes from: from stable_baselines3.common.vec_env import VecCheckNan env = VecCheckNan(env, raise_exception=True)

also this is a page from the original stable-baselines showing some possible cases that cause this issue: https://stable-baselines.readthedocs.io/en/master/guide/checking_nan.html

Top Results From Across the Web

FinRL-Meta: Market Environments and Benchmarks for ...

A collection of market environments and benchmarks for data-driven financial reinforcement learning.

Transfer Learning Guide: A Practical Tutorial With ...

This is important because the pre-trained model is loaded without the final output layer. New trainable layers Source. Train the new layers on...

Error in train.default(x, y, weights = w, ...) : final tuning ...

I get the following error when I run the code below. Error in train.default(x, y, weights = w, ...) : final tuning parameters...

Understanding the Python Traceback

The final line of the traceback output tells you what type of exception was raised along with ... Green box: After the exception...

How to avoid broken formulas

If Excel can't resolve a formula you're trying to create, you may get an error message like this one: Image of Excel's "Problem...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

[Question] Error after loading model with FinRL

Question

Checklist

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Current ONNX opset doesn't support StableBaselines3 natively, requires creating a wrapper class.

[Question] Suport to unfeasible regions of the domain