Discussion about future plans after deprecation of OpenAI Gym
See original GitHub issue❓ Questions and Help
Should we decouple CompilerGym from (past) openai/gym?
The team that has been maintaining Gym since 2021 has moved all future development to Gymnasium, a drop in replacement for Gym (import gymnasium as gym), and this repo isn’t planned to receive any future updates. Please consider switching over to Gymnasium as you’re able to do so. https://github.com/openai/gym#important-notice
Additional Context
I assume you are already aware of it, gym’s maintainer has been making breaking API changes.
def step(self, action: ActType) -> Tuple[ObsType, float, bool, bool, dict]:
I reckon the changes are quite reasonable, but previous API(returning observation, reward, done, info) served as quite rigorous protocol for reinforcement learning.
Is there already on-going discussion regarding this?
TL;DR
Most inheritance to CompilerGym from openai/gym was for typing,
rather than it’s functionality - openai/gym doesn’t really have much functionality itself.
So we can easily define our own Space
, Env
, etc
However, if we decouple, many RL libraries depend on openai/gym and might conflict
ray[rllib]
do strong type checking such as isinstance(obs_space, gym.space.Space)
I hope we(CompilerGym users) do discussion here to help FAIR team.
Issue Analytics
- State:
- Created 10 months ago
- Comments:5 (5 by maintainers)
Top GitHub Comments
Hi Anthony,
Thanks for the extra pointers. I’m in no rush to make preemptive changes, especially if those are compatibility breaking, so I’ll keep an eye on rllib developments.
No, you’ll be fine with the code snipped you posted. What I meant was that for things like vectorized environments or async environments, we may be able to more efficiently implement them as extensions to the C++ layer and then expose the functionality through new APIs. This wouldn’t break any existing code.
Cheers, Chris
There’s no specific issue thread for this, but
Looking at commit history from https://github.com/ray-project/ray/commits/releases/2.2.0/rllib
seems like it would take some time
This is chat from @sven1977
Additionally, I’m getting back to compiler optimization with Programl, and I remember async env worked seamlessly, (though I encountered some unable to pickle lock object issue with
cbench
regarding https://github.com/facebookresearch/CompilerGym/issues/304)Is there something I should be aware of? from your reply
My env would be async-ed from this env
Thanks! Anthony