强化学习：原理与Python实现

世界上第一本配套 TensorFlow 2 代码的强化学习教程书

中国第一本配套 TensorFlow 2 代码的纸质算法书

本书介绍强化学习理论及其 Python 实现。

理论完备：全书用一套完整的数学体系，严谨地讲授强化学习的理论基础，主要定理均给出证明过程。各章内容循序渐进，覆盖了所有主流强化学习算法，包括资格迹等非深度强化学习算法和柔性执行者/评论者等深度强化学习算法。
案例丰富：在您最爱的操作系统（包括 Windows、macOS、Linux）上，基于最新的 Python 3.7、Gym 0.17 和 TensorFlow 2.1（兼容 TensorFlow 1.15），实现强化学习算法。全书实现统一规范，体积小、重量轻。第 1～9 章给出了算法的配套实现，环境部分只依赖于 Gym 的最小安装，在没有 GPU 的计算机上也可运行；第 10～12 章介绍了多个热门综合案例，涵盖 Gym 的完整安装和自定义扩展，在有普通 GPU 的计算机上即可运行。

Reinforcement Learning: Theory and Python Implementation

The First Reinforcement Learning Tutorial Book with TensorFlow 2 Implementation

This is a tutorial book on reinforcement learning, with explanation of theory and Python implementation.

Theory: Starting from a uniform mathematical framework, this book derives the theory and algorithms of reinforcement learning, including all major algorithms such as eligibility traces and soft actor-critic algorithms.
Practice: Every chapter is accompanied by high quality implementation based on Python 3.7, Gym 0.17, and TensorFlow 2.1.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
chapter01_intro		chapter01_intro
chapter02_mdp		chapter02_mdp
chapter03_dp		chapter03_dp
chapter04_mc		chapter04_mc
chapter05_td		chapter05_td
chapter06_approx		chapter06_approx
chapter07_pg		chapter07_pg
chapter08_ac		chapter08_ac
chapter09_dpg		chapter09_dpg
chapter10_atari		chapter10_atari
chapter11_alphazero		chapter11_alphazero
chapter12_drive		chapter12_drive
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
notations.pdf		notations.pdf