[rllib] Vectorization & multi-agent are broken in DDPG (both TF and Torch)
See original GitHub issueWhat is the problem?
./train.py --run=DDPG --env=MountainCarContinuous-v0 --config='{"num_envs_per_worker": 2}'
and
./train.py --run=DDPG --env=MountainCarContinuous-v0 --config='{"num_envs_per_worker": 2}' --torch
both currently crash. The issue seems to be that while the input to compute_actions() is a batch of N observations, only 1 action is returned as output. I ran into this while debugging a hang in a multi-agent test case (hung due to a bug in the env triggered by vectorization returning 1 action instead of N actions).
Issue Analytics
- State:
- Created 3 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
Algorithms — Ray 1.13.0
[paper] [implementation] RLlib implements both A2C and A3C. These algorithms scale to 16-32+ worker processes depending on the environment.
Read more >Algorithms — Ray 2.2.0 - the Ray documentation
Instead, gradients are computed remotely on each rollout worker and all-reduced at each mini-batch using torch distributed. This allows each worker's GPU to...
Read more >Getting Started with RLlib — Ray 2.2.0 - the Ray documentation
In multi-agent training, the algorithm manages the querying and ... Policy- or the Algorithm's checkpoints also contain (tf or torch) native model files....
Read more >Examples — Ray 2.2.0
Example of how to setup an RLlib Algorithm against a locally running Unity3D editor instance to learn any Unity3D game (including support for...
Read more >rllib-training.rst.txt - the Ray documentation
In multi-agent training, the algorithm manages the querying and optimization of multiple ... if eager_tracing=True) # torch: PyTorch "framework": "tf", ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
git bisect run script.sh
script:
Leaving this open until merged.