Applications of Reinforcement Learning: Lunar Lander Simulation

Abstract

The purpose of the following reinforcement learning experiment is to investigate optimal parameter values for deep Q-learning (DQN) on the Lunar Lander problem provided by OpenAI Gym. The LunarLander-v2 is an environment with uncertainty and this investigation explores optimal parameters that will maximize the mean reward over 400 episodes or less. A deep learning network is designed for the agent and various reinforcement learning parameters are used to carry out the simulation. Through the use of a neural network with two hidden layers, the agent was able to converge to a mean average reward score of 200 with epsilon = 0.9, epsilon-decay = 0.995, alpha (learning rate) = 0.001, and gamma (discount factor) = 0.99 in a little over 250 episodes. A comparative analysis between different parameters used is also performed. The results and the architecture of the model used from this experiment are also compared to other similar experiments that employ the DQN method for the Lunar Lander problem.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
figures		figures
log_files		log_files
.gitignore		.gitignore
720_Final_Presentation.pdf		720_Final_Presentation.pdf
DQN_Lunar_Lander_Final_720.pdf		DQN_Lunar_Lander_Final_720.pdf
README.md		README.md
dqn.py		dqn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applications of Reinforcement Learning: Lunar Lander Simulation

Abstract

About

Releases

Packages

Languages

shilpakancharla/deep-rl-lunar-lander

Folders and files

Latest commit

History

Repository files navigation

Applications of Reinforcement Learning: Lunar Lander Simulation

Abstract

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages