Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Hyper-parameters for successful DQN Agent

See original GitHub issue

Hi @marload,

Great repository you have here 😄! I am running your DQN script and I am trying to solve CartPole with it (consistently get >200 score).

I ran the script with the default parameters, but the agent is having trouble learning a successful policy. All I get is fluctuating scores between 10 and 100 for the first 800 episodes I trained it on. There was one episode with >200 but it was early in the training and having in mind that eps would have been very high at this point I think this must have been due to chance.

So my question is - if you have trained a successful agent with this algorithm can you provide me with “working” parameters? Or maybe DQN is just unstable in nature and I should run the script a couple of more times and hope for something better?

I have not reviewed the code thoroughly, because I wanted to see it working first, but at first glance, it looks clean and simple.

Anyway, thanks for posting it on Reddit, not sure why it was deleted. I hope I can learn a thing or two from it since I am working on something similar at the moment. 😄

Have a great day!

Issue Analytics

State:
Created 4 years ago
Reactions:1
Comments:5 (3 by maintainers)

Top GitHub Comments

1reaction

marloadcommented, Mar 23, 2020

Thx 😃

1reaction

marloadcommented, Mar 23, 2020

Sorry for replying late! I made the DQN and DRQN work normally. In addition, the code style has been changed to a better architecture. Thank you for waiting!

Top Results From Across the Web

DDQN hyperparameter tuning using Open AI gym Cartpole

Tuning hyperparameters of the new energy_py DDQN reinforcement ... The agent was often able to solve the CartPole-v0 environment (Open AI ...

Hyperparameter tuning with Zegami

Our algorithm will randomly sample each of these ranges, construct a DQN with those hyperparameters, and train it for 1000 episodes.

Hyperparameters for the DQN - O'Reilly

The following hyperparameters are passed to the DQN agent: episodes : The number of games the agent will play. gamma : Decay or...

Hyperparameter Optimization for Deep Reinforcement ...

This paper presents a model-based hyperparameter op- timization of the Deep Deterministic Policy Gradients (DDPG) algorithm and demonstrates it with a hybrid.

How to Automate Hyperparameter Tuning for Reinforcement ...

... hyperparameter tuning for deep reinforcement learning models. ... trying to figure out a combination of hyperparameters for your agents.