Skip to content

Gym implementation of Least-Squares Policy Iteration (Lagoudakis & Parr, 2003)

License

Notifications You must be signed in to change notification settings

qdevpsi3/rl-lspi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Least-Squares Policy Iteration

paper language packages license exp_1

Description

This repository contains an implementation of the model-free approaches in :

  • Paper : Least-Squares Policy Iteration
  • Authors : Lagoudakis and Parr
  • Date : 2003

Setup

To install, clone this repository and execute the following commands :

$ cd rl-lspi
$ pip install -r requirements.txt
$ pip install -e .

Details

The available policy evaluation methods are :

  • LSTDQ (iterative or by batch)
  • LSTDQ-OPT (iterative, based on Sherman-Morrison formula)

The available features are :

  • Polynomial functions
  • Radial Basis functions

The experiments in the paper are reproduced for the following environments :

  • Experiment 1 : Chain Walk environment
  • Experiment 2 : Inverted Pendulum environment

About

Gym implementation of Least-Squares Policy Iteration (Lagoudakis & Parr, 2003)

Topics

Resources

License

Stars

Watchers

Forks

Languages