How to get expert data ?
See original GitHub issueI have read all the issues. Thanks for responding to them. I had a query regarding extracting expert data from the replay buffer. As you’ve specified in one of the issues, due to the size of the data, 50M datapoints from the game have been split into 50 files of 1M datapoints each. I wanted to ask, does this mean that the last 10 files in [GAME_NAME]/1/replay_logs/
(buffer files ending (having suffix) with 41, 42…50) represent expert behavior while the first buffer files (ending with 1, 2,…10) represent beginner level performance ? I tried searching about my question and found this in the AI blog For example, the first k million frames from the DQN replay dataset emulate exploration data with suboptimal returns while the last k million frames are analogous to near-expert data with stochasticity
. So going by what is being said, by considering the buffer files that end with 40-50, is it safe to assume that I’m extracting expert level behavior ?
Issue Analytics
- State:
- Created a year ago
- Comments:5
Yes, the last 10 files correspond to the last 40M frames seen by a DQN agent trained for 200M frames. For expert data, I’d simply load the last buffer. For example, see this paper which did expert data selection on the Atari datasets: https://openreview.net/forum?id=AP1MKT37rJ
Also,
checkpoint_49
can also be easily extracted from the tfds version of this datasets (easy to plug and play): https://colab.research.google.com/github/google-research/rlds/blob/main/rlds/examples/tfds_rlu_atari.ipynbThanks again for clarifying things up.