Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why is dqn playing breakout so bad? I trained 400 epochs and the final result did not exceed 6 points. #54

Closed
invisilmk opened this issue May 25, 2020 · 9 comments · Fixed by #170 or #187
Labels
bug Something isn't working

Comments

@invisilmk
Copy link

invisilmk commented May 25, 2020

I trained 400 epochs and the final result did not exceed 6 points. What parameters need to be adjusted?

@Trinkle23897
Copy link
Collaborator

Trinkle23897 commented May 25, 2020

Could you please provide your configuration? (e.g. network architecture, hyperparam, ...)

@Trinkle23897 Trinkle23897 added the question Further information is requested label May 25, 2020
@invisilmk
Copy link
Author

It was run on the original code, but increased to 400 epochs.

@Trinkle23897
Copy link
Collaborator

It's our fault that we have no time to provide the tuned code example in the examples folder currently.
A better solution is to change the example code's hyperparameter and network arch to the DQN paper's setting.
You can refer to #31 (comment) for more details.

@Trinkle23897
Copy link
Collaborator

@Mehooz

@invisilmk
Copy link
Author

Thanks a lot.

@duburcqa
Copy link
Collaborator

@Trinkle23897 Closing ?

@Trinkle23897
Copy link
Collaborator

@Trinkle23897 Closing ?

Nope. It hasn’t been finished yet.

@davide97l
Copy link

I'm waiting for this too.

@Trinkle23897 Trinkle23897 linked a pull request Jul 29, 2020 that will close this issue
9 tasks
@Trinkle23897 Trinkle23897 linked a pull request Aug 19, 2020 that will close this issue
@Trinkle23897
Copy link
Collaborator

Trinkle23897 commented Aug 19, 2020

Here it is: we now provide a DQN script which can solve Pong in 1M environment step (about 2000 seconds wall time in my computer). #187
The main issue of the previous version is that it doesn't use the wrapper provided by Deepmind. This is quite crucial.
Many thanks to @davide97l

@Trinkle23897 Trinkle23897 added bug Something isn't working and removed question Further information is requested labels Aug 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants