Distributed-Cooperative-Multi-Agent-Reinforcement-Learning-in-Markov-Games

Team Q-Learning, Distributed Q-Learning, QD-Learning, and Networked Actor-Critic Algorithms for Cooperative Markov Games

In this project, we will consider distributed multi-agent reinforcement learning (MARL) in cooperative Markov games. We first consider a simple single-agent Q-learning algorithm to solve a single-agent MDP as a starting point. Afterwards, we will implement four MARL algorithms to solve multi-agent tasks modeled by cooperative Markov games. The first three algorithms are all value-based and include Distributed Q-Learning, Team Q-Learning, and QD-Learning. The first two algorithms do not require communication among agents for convergence but only converge in deterministic MDPs, while QD-Learning is communication-based. This algorithm and the fourth algorithm which is a communication-based actor-critic algorithm for cooperative agents maximize the sum of expected return of all agents.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
code.ipynb		code.ipynb
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed-Cooperative-Multi-Agent-Reinforcement-Learning-in-Markov-Games

Team Q-Learning, Distributed Q-Learning, QD-Learning, and Networked Actor-Critic Algorithms for Cooperative Markov Games

About

Releases

Packages

Languages

License

hafezgh/Distributed-Cooperative-Multi-Agent-Reinforcement-Learning-in-Markov-Games

Folders and files

Latest commit

History

Repository files navigation

Distributed-Cooperative-Multi-Agent-Reinforcement-Learning-in-Markov-Games

Team Q-Learning, Distributed Q-Learning, QD-Learning, and Networked Actor-Critic Algorithms for Cooperative Markov Games

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages