Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1.Total reward/1.Total reward is illegal; using 1.Total_reward/1.Total_reward instead. #92

Closed
Binbose opened this issue Dec 5, 2020 · 2 comments · Fixed by #104
Closed

Comments

@Binbose
Copy link

Binbose commented Dec 5, 2020

When trying to train the model on lunar lander with the default config file, I get a bunch of error messages like:

INFO:root:Summary name 1.Total reward/3.Episode length is illegal; using 1.Total_reward/3.Episode_length instead.
INFO:root:Summary name 1.Total reward/4.MuZero reward is illegal; using 1.Total_reward/4.MuZero_reward instead.
INFO:root:Summary name 1.Total reward/5.Opponent reward is illegal; using 1.Total_reward/5.Opponent_reward instead.
INFO:root:Summary name 2.Workers/1.Self played games is illegal; using 2.Workers/1.Self_played_games instead.
INFO:root:Summary name 2.Workers/2.Training steps is illegal; using 2.Workers/2.Training_steps instead.
INFO:root:Summary name 2.Workers/3.Self played steps is illegal; using 2.Workers/3.Self_played_steps instead.
INFO:root:Summary name 2.Workers/4.Reanalysed games is illegal; using 2.Workers/4.Reanalysed_games instead.
INFO:root:Summary name 2.Workers/5.Training steps per self played step ratio is illegal; using 2.Workers/5.Training_steps_per_self_played_step_ratio instead.
INFO:root:Summary name 2.Workers/6.Learning rate is illegal; using 2.Workers/6.Learning_rate instead.
INFO:root:Summary name 3.Loss/1.Total weighted loss is illegal; using 3.Loss/1.Total_weighted_loss instead.
INFO:root:Summary name 3.Loss/Value loss is illegal; using 3.Loss/Value_loss instead.
INFO:root:Summary name 3.Loss/Reward loss is illegal; using 3.Loss/Reward_loss instead.
INFO:root:Summary name 3.Loss/Policy loss is illegal; using 3.Loss/Policy_loss instead.

however the model is training (without improving much though). What do these messages mean?

@werner-duvaud
Copy link
Owner

Hi,
Can you please make sure to be using the latest python packages, especially the TensorboardX package ?
If it persists, can you provide additional information about your system ?

Either way, adding an underscore to the tensoboard variable names should solve the problem.

@dribnet
Copy link
Contributor

dribnet commented Jan 1, 2021

I am getting the same behaviour on this and other training attempts (eg: cartpole, connect4) and am using Tensorboard 2.4.0

dribnet added a commit to dribnet/muzero-general that referenced this issue Jan 2, 2021
Updated strings passted to SummaryWriter.add_scalar removing
spaces and replacing with underscores. Fixes werner-duvaud#92.
werner-duvaud pushed a commit that referenced this issue Jan 7, 2021
Updated strings passted to SummaryWriter.add_scalar removing
spaces and replacing with underscores. Fixes #92.
egafni pushed a commit to egafni/muzero-general that referenced this issue Apr 15, 2021
…ner-duvaud#104)

Updated strings passted to SummaryWriter.add_scalar removing
spaces and replacing with underscores. Fixes werner-duvaud#92.
EpicLiem pushed a commit to EpicLiem/muzero-general-chess-archive that referenced this issue Feb 4, 2023
…ner-duvaud#104)

Updated strings passted to SummaryWriter.add_scalar removing
spaces and replacing with underscores. Fixes werner-duvaud#92.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants