Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Discussion about future plans after deprecation of OpenAI Gym

See original GitHub issue

❓ Questions and Help

Should we decouple CompilerGym from (past) openai/gym?

The team that has been maintaining Gym since 2021 has moved all future development to Gymnasium, a drop in replacement for Gym (import gymnasium as gym), and this repo isn’t planned to receive any future updates. Please consider switching over to Gymnasium as you’re able to do so. https://github.com/openai/gym#important-notice

Additional Context

I assume you are already aware of it, gym’s maintainer has been making breaking API changes.

def step(self, action: ActType) -> Tuple[ObsType, float, bool, bool, dict]:

https://github.com/Farama-Foundation/Gymnasium/blob/1956e648f951eb64c45440997f8fe484ef3c7138/gymnasium/core.py#L71

I reckon the changes are quite reasonable, but previous API(returning observation, reward, done, info) served as quite rigorous protocol for reinforcement learning.

Is there already on-going discussion regarding this?

TL;DR Most inheritance to CompilerGym from openai/gym was for typing, rather than it’s functionality - openai/gym doesn’t really have much functionality itself. So we can easily define our own Space, Env, etc

However, if we decouple, many RL libraries depend on openai/gym and might conflict ray[rllib] do strong type checking such as isinstance(obs_space, gym.space.Space)

I hope we(CompilerGym users) do discussion here to help FAIR team.

Issue Analytics

State:
Created 10 months ago
Comments:5 (5 by maintainers)

Top GitHub Comments

1reaction

ChrisCumminscommented, Nov 30, 2022

Hi Anthony,

Thanks for the extra pointers. I’m in no rush to make preemptive changes, especially if those are compatibility breaking, so I’ll keep an eye on rllib developments.

Is there something I should be aware of?

No, you’ll be fine with the code snipped you posted. What I meant was that for things like vectorized environments or async environments, we may be able to more efficiently implement them as extensions to the C++ layer and then expose the functionality through new APIs. This wouldn’t break any existing code.

Cheers, Chris

0reactions

anthony0727commented, Nov 30, 2022

There’s no specific issue thread for this, but

Looking at commit history from https://github.com/ray-project/ray/commits/releases/2.2.0/rllib

seems like it would take some time

This is chat from @sven1977

We are in the process of updating RLlib to be fully compliant with the new gymnasium (or gym>=0.26) APIs. This will include an updated RLlib pettingzoo adapter. It’s a massive undertaking but will hopefully make it into Ray 2.2 (fingers crossed that we get all failing tests to pass by then 🙂 )

Additionally, I’m getting back to compiler optimization with Programl, and I remember async env worked seamlessly, (though I encountered some unable to pickle lock object issue with cbench regarding https://github.com/facebookresearch/CompilerGym/issues/304)

Is there something I should be aware of? from your reply

Would unlock more aggressive changes to CompilerGym APIs for things like performance (e.g. async / batched envs).

My env would be async-ed from this env

def register_all(config, model_cls=None):
    def make_env(dummy):
        env = compiler_gym.make('llvm-ic-v0', **config.env_config)
        train_benchmarks = []
        for i in ['cbench-v1', 'npb-v0', 'chstone-v0']:
            train_benchmarks.extend(
                list(env.datasets[i].benchmark_uris())
            )
        env = CommandlineWithTerminalAction(env)
        env = RandomOrderBenchmarks(env, train_benchmarks)
        env = RllibWrapper(env)
        # env = TimeLimit(env, max_episode_steps=128)
        return env

    register_env(
        ENV_ID,
        make_env
    )

Thanks! Anthony

Top Results From Across the Web

Plans for Future Maintenance of Gym · Issue #2259 - GitHub

So OpenAI made me a maintainer of Gym. This means that all the installation issues will be fixed, the multi-year backlog of PRs...

[N] OpenAI Gym maintainer plans to deprecate and replace ...

[N] OpenAI Gym maintainer plans to deprecate and replace MuJoCo and Box2D environments with Brax-based environments.

hardmaru on Twitter: "OpenAI Gym maintainer plans to ...

OpenAI Gym maintainer plans to deprecate and replace MuJoCo and Box2D ... and the unclear future plans for MuJoCo (the discussion thread goes...

Announcing The Farama Foundation - The future of open ...

It's our understanding that OpenAI has no plans to develop Gym going forward, so this won't create a situation where the community becomes ......

Reinforcement Learning w/ Keras + OpenAI: DQNs

The first is the future rewards depreciation factor (<1) discussed in the earlier equation, and the last is the standard learning rate parameter ......