Skip to content

Latest commit

 

History

History
41 lines (30 loc) · 1.6 KB

README.md

File metadata and controls

41 lines (30 loc) · 1.6 KB

Least-Squares Policy Iteration

paper language packages license exp_1

Description

This repository contains an implementation of the model-free approaches in :

  • Paper : Least-Squares Policy Iteration
  • Authors : Lagoudakis and Parr
  • Date : 2003

Setup

To install, clone this repository and execute the following commands :

$ cd rl-lspi
$ pip install -r requirements.txt
$ pip install -e .

Details

The available policy evaluation methods are :

  • LSTDQ (iterative or by batch)
  • LSTDQ-OPT (iterative, based on Sherman-Morrison formula)

The available features are :

  • Polynomial functions
  • Radial Basis functions

The experiments in the paper are reproduced for the following environments :

  • Experiment 1 : Chain Walk environment
  • Experiment 2 : Inverted Pendulum environment