Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimization of some parameters for tictactoe. #169

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

AdrianAcala
Copy link

Here's some optimizations of for tictactoe. I had 12 threads on my CPU so I had 12 workers, but otherwise I tested it multiple times and provided great results.

CC: @theword / @JohnPPP

First run
image

Second run
image

@AdrianAcala
Copy link
Author

Let me know if you want me to also include the best model I can create.

@werner-duvaud
Copy link
Owner

werner-duvaud commented Aug 6, 2021

Hi Adrian,

Thanks!

Are your plots showing the results against the expert opponent or against the random opponent?
It looks really good.

The initial learning rate might be a bit high, I got sometimes some NaN at the beginning but when it avoids the NaN during the first 500 games, it seems pretty robust in its progress then.

Maybe adding an option to pre-fill the replay buffer with random games should help avoiding these NaNs at the start.

If you have not too bad weights it would be great to add them to the repo too!

@AdrianAcala
Copy link
Author

@werner-duvaud , the results are against the expert opponent.

Yeah. The learning rate is quite high and I tried several times to test to see if I ran into NaN which I didn't, but if you did, then we'll need to test a bit. One of the articles I read, mentioned having a periodic learning rate was helpful. In this case, it would also make a lot of sense. Maybe I can incorporate that as well as a feature flag.

My weights were really good. One time I tried playing it and the game ended abruptly. At first, I thought there was a bug, then I realized I lost. 😆

I'll train it for longer to see what we get.

@isaiah2004
Copy link

@AdrianAcala How much ram did you have?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants