Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why I can't converge the network when I use your default setting(terminal: python run_atari_dqn.py)? #6

Open
nizhihao opened this issue Feb 7, 2021 · 5 comments

Comments

@nizhihao
Copy link

nizhihao commented Feb 7, 2021

Hello, first of all thanks for sharing your amazing job. I want to know Why I can't converge the network when I use your default setting(terminal: python run_atari_dqn.py)? the average 100 reward value always in about -20.9, which can't increase as the time-step increase. And Can you share your code about how to draw good pictures like you folder ./imgs/ pictures. Thank your very much!!

@BY571
Copy link
Owner

BY571 commented Feb 7, 2021

You are running PongNoFrameskip-v4? and for how long did you let it train?

I have another repository for how to do the plottings: https://github.com/BY571/Tensorboard-Plotting

@nizhihao
Copy link
Author

nizhihao commented Feb 8, 2021

Yeah, it's the PongNoFrameskip-v4, I use the default parameters, because of I use python==2.7 and torch==1.0.0, I only modify the function random.choices() in dqn_agent.py. And I have train the code with default frame==5e6. The average100 reward graph I get from tensorboard is put up:
image

@BY571
Copy link
Owner

BY571 commented Feb 8, 2021

I checked yesterday and noticed the same. for CartPole-v0 and LunarLander-v2 it converges nicely. But as you noticed for Pong it was struggling. I'll have a look later might be something with the wrappers of the environment. Once I found out whats the problem I come back to you :)

@nizhihao
Copy link
Author

nizhihao commented Feb 8, 2021

Em, Ok , Thanks!Let me try the CartPole-v0 and LunarLander-v2 environment firstly, and then try others. thanks for your answer :)

@BY571
Copy link
Owner

BY571 commented Feb 22, 2021

how did your runs work out for cartpole and lunarlander? sry i was quite busy and couldn't dig deeper in the problem yet

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants