Skip to content

Evaluating MuZero's performance using Super Mario Bros (OpenAI Gym)

License

Notifications You must be signed in to change notification settings

sreeharshau/muzero-super-mario-bros

Repository files navigation

muzero-super-mario-bros

Evaluating MuZero's performance using Super Mario Bros (OpenAI Gym)

This project evaluates MuZero using Super Mario Bros and compares it's performance to a custom implemented Deep-Q-Network with Double-Q-Learning (DDQN).

MuZero in Action

Some clips of the agent trained using MuZero in action are shown below.

Muzero_Mario_GIF_1 MuZero_Mario_GIF_2 MuZero_Mario_GIF_3 MuZero_Mario_GIF_4

System Architecture

MuZero_Architecture

How does it hold up against DDQN?

The algorithms were evaluated on a selected overworld level. The number of training epochs were limited by the available computational power. For more details, such as hyper-parameter tuning, please refer to the project report

MuZero_vs_DDQN

Full Project Report

The full project report can be found here.

Citation

Please use the following citation when referring to any results from the repository or the report:

Udayashankar, S., 2022. Evaluating MuZero on Super Mario Bros. [online] GitHub.

About

Evaluating MuZero's performance using Super Mario Bros (OpenAI Gym)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published