Reinforcement Learning: N-Step SARSA and λ-SARSA

This code implements the N-Step SARSA and λ-SARSA algorithms for reinforcement learning in the WindyGridworld environment.

Environment

The WindyGridworld class represents the Windy Gridworld environment. It is a grid with a start state, a goal state, and wind effects in certain columns. The agent can take actions to move in the grid, and the goal is to reach the goal state while minimizing the number of steps and avoiding obstacles.

Algorithms

The code implements the following algorithms:

N-Step SARSA

The n_step_sarsa function implements the N-Step SARSA algorithm. It takes the following parameters:

env: The environment object representing the Windy Gridworld.
n: The number of steps to look ahead for updates.
alpha: The learning rate.
gamma: The discount factor.
epsilon: The exploration rate.
num_episodes: The number of episodes to run the algorithm.

The function returns the learned Q-values, episode rewards, and episode lengths.

λ-SARSA

The lambda_sarsa function implements the λ-SARSA algorithm. It takes similar parameters as the N-Step SARSA algorithm, including an additional parameter lmbda representing the eligibility trace decay rate.

The function returns the learned Q-values, episode rewards, and episode lengths.

Running the Code

To run the code, follow these steps:

Create an instance of the WindyGridworld environment.
Set the algorithm parameters such as learning rate, discount factor, exploration rate, and the number of episodes.
Call the desired algorithm function (n_step_sarsa or lambda_sarsa) with the environment and parameters.
Plot the learning curves to visualize the algorithm's performance.

You can modify the algorithm parameters and experiment with different settings to observe their impact on learning.

For detailed implementation and usage examples, refer to the code comments.

Support

Contact me @:

e-mail:

farzanehkoohestani2000@gmail.com

Telegram id:

@farzaneh_koohestani

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
WindyGridworld.py		WindyGridworld.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning: N-Step SARSA and λ-SARSA

Environment

Algorithms

N-Step SARSA

λ-SARSA

Running the Code

Support

License

About

Releases

Packages

Languages

License

farkoo/N-Step-SARSA-Lambda-SARSA

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: N-Step SARSA and λ-SARSA

Environment

Algorithms

N-Step SARSA

λ-SARSA

Running the Code

Support

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages