Skip to content

Latest commit

 

History

History
21 lines (19 loc) · 714 Bytes

README.md

File metadata and controls

21 lines (19 loc) · 714 Bytes

Pendulum_PPO

Solving OpenAI Pendulum-v0 using Proximal Policy Optimization Algorithms

Quickstarted

Run this command to use the pretrained model to play the game

>python pendulum.py play

Or run this command to train the model

>python pendulum.py anything-(not-play)

Notes

The model in pendulum.py was able to solved Pendulum-v0 after about 110 episodes

Total rewards in 140 steps of traing: Total rewards in 140 steps of traing
You're free to edit the model hyperparameters and some constansts to make it better

Contributions

Special thanks to Morvan Zhou for the explanation of the PPO