Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Enhancement] Support for fallbacks when loading cloudpickle items fail

See original GitHub issue

Related issue #171

As seen in #171, cloudpickle/pickle can easily fail when transferring models between Python versions (and in case some other shenanigans). There is currently no way to address this issue as loading of model tries to un-pickle all pickle objects and does not catch errors.

Two suggestions to deal with this, both of which would be nice to have:

Provide custom_objects dictionary (similar to what we had in SB2), which will override any objects read from file. This could be used to even prevent loading the item from the file, in which case all JSON-ed items can be used as before.
Update set_parameters function to only load parameters from the file but not the data objects (which include pickled items). This was included in SB2.

@ManifoldFR If you had other suggestions please feel free to throw them here 😃

Issue Analytics

State:
Created 3 years ago
Comments:5 (3 by maintainers)

Top GitHub Comments

1reaction

ManifoldFRcommented, Oct 1, 2020

From what I saw the problem I encountered is with the learning rate and clip ratio schedulers (for PPO), in the particular case where a constant scheduler is created using constant_fn (I don’t know about other cases though) because pickling/unpickling functions is brittle especially when they are locals of a higher-order functon. I think we can look at how PyTorch supports saving state dicts for schedulers. From the source I see they use classes and the __dict__ attribute to save/load state dicts.

In general I think the ideas for other fallbacks are good because it allows the user more flexibility.

0reactions

araffincommented, Mar 6, 2021

as a follow-up, I added support for custom objects here: https://github.com/DLR-RM/stable-baselines3/pull/336 and included that in the rl zoo: https://github.com/DLR-RM/rl-baselines3-zoo/pull/69 (so models trained with python 3.6/3.7 can be loaded with python 3.8+, but not retrained yet)