Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How many training steps used to obtain the pre-trained model? #70

Closed
xinghua-qu opened this issue Mar 19, 2020 · 1 comment
Closed

How many training steps used to obtain the pre-trained model? #70

xinghua-qu opened this issue Mar 19, 2020 · 1 comment

Comments

@xinghua-qu
Copy link

Is there any document illustrating how many training steps used to obtain the pre-trained model? Some pretrained model seems far less than the start-of-the-art. For instance, the dqn model on BeamRider and Qbert only achieve 948.0 and 550.0. However, using other policies (e.g., PPO2 and ACKTR), such reward values could be 10,000+.
It would be better if you can provide these pre-trained models as a trustworthy baseline for benchmarking.

@araffin araffin closed this as completed Mar 19, 2020
@araffin araffin reopened this Mar 19, 2020
@araffin
Copy link
Owner

araffin commented Mar 19, 2020

duplicate of #38

@araffin araffin closed this as completed Mar 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants