muzero-super-mario-bros

Evaluating MuZero's performance using Super Mario Bros (OpenAI Gym)

This project evaluates MuZero using Super Mario Bros and compares it's performance to a custom implemented Deep-Q-Network with Double-Q-Learning (DDQN).

MuZero in Action

Some clips of the agent trained using MuZero in action are shown below.

System Architecture

How does it hold up against DDQN?

The algorithms were evaluated on a selected overworld level. The number of training epochs were limited by the available computational power. For more details, such as hyper-parameter tuning, please refer to the project report

Full Project Report

The full project report can be found here.

Citation

Please use the following citation when referring to any results from the repository or the report:

Udayashankar, S., 2022. Evaluating MuZero on Super Mario Bros. [online] GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Evaluating_MuZero_Super_Mario_Bros.pdf		Evaluating_MuZero_Super_Mario_Bros.pdf
LICENSE		LICENSE
MuZero_Architecture.jpg		MuZero_Architecture.jpg
MuZero_vs_DDQN.png		MuZero_vs_DDQN.png
README.md		README.md
agent01_19_359(1).gif		agent01_19_359(1).gif
agent01_356_1420(1).gif		agent01_356_1420(1).gif
agent01_623_3196(1).gif		agent01_623_3196(1).gif
agent01_754_3191(1).gif		agent01_754_3191(1).gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

muzero-super-mario-bros

MuZero in Action

System Architecture

How does it hold up against DDQN?

Full Project Report

Citation

About

Releases

Packages

License

sreeharshau/muzero-super-mario-bros

Folders and files

Latest commit

History

Repository files navigation

muzero-super-mario-bros

MuZero in Action

System Architecture

How does it hold up against DDQN?

Full Project Report

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages