[question] Why are observation and achieved_goal the same in bit flipping env?
See original GitHub issueIn the observation dictionary returned from the bit flipping env, observation
and achieved_goal
are the same:
Aren’t all of the dictionary elements {observation, achieved_goal, desired_goal}
fed to the neural network as input? Since observation
and achieved_goal
are the same, isn’t the neural network being fed the same input twice? Wouldn’t it be better for observation
to be an empty list then?
Or do the neural network inputs consist only of observation
and desired_goal
?
Issue Analytics
- State:
- Created 4 years ago
- Reactions:1
- Comments:5
Top Results From Across the Web
stable_baselines3.common.envs.bit_flipping_env
[docs]class BitFlippingEnv(Env): """ Simple bit flipping env, useful to test HER. The goal is to flip all the bits to get a vector...
Read more >Bias-Reduced Hindsight Experience Replay with Virtual ...
Consider the bit-flipping environment, in- troduced in [9] with few modifications. In the bit-flipping environment, the state- and action spaces are.
Read more >Randomized algorithms for matrices and data
Numerous examples illustrating these observations will be ... A are equal to the diagonal elements of the projection matrix onto the span of...
Read more >A functional approach to movement analysis and error ...
As a matter of course, the same is true if rules are not broken but ... the question of what a certain movement...
Read more >Quantum Computation and Quantum Information
8.3.3 Bit flip and phase flip channels ... questions combining physics, computer science, and information theory. ... is to be achieved (goal 3)....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I’m closing this issue as the initial question was answered.
for the reason I mentioned before. It is more for correctness (e.g. if you use another implementation of HER). In practice, with the current version of SB, it does not matter.
Exactly, so why is a) only correct when the object is always at the same place? Since
achieved_goal
is fed to the agent, it doesn’t matter if the object is always at the same place or not, right?