PARL/examples/QMIX at develop · PaddlePaddle/PARL

History

Name		Name	Last commit message	Last commit date
parent directory ..
images		images
README.md		README.md
env_wrapper.py		env_wrapper.py
qmix_agent.py		qmix_agent.py
qmix_config.py		qmix_config.py
qmixer_model.py		qmixer_model.py
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
rnn_model.py		rnn_model.py
train.py		train.py
utils.py		utils.py

README.md

QMIX based on PARL and PaddlePaddle2.0

We reproduce the QMIX based on PARL and PaddlePaddle>=2.0.0, reaching the same level of indicators as the paper in StarCraft2 benchmarks.

QMIX

QMIX is a value-based multi-agent reinforcement learning algorithm.
Learn more about QMIX from: QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

StarCraft2 Environment

Paper: The StarCraft Multi-Agent Challenge
Github Repositories: smac

Benchmark Results

We trained our model in 5 different scenarios: "3m", "8m", "2s_3z", "3s_5z" and "1c_3s_5z".
The difficulty in all scenarios are set to be "7" (very difficult).
We trained our model 3 times for each scenario.

How to Use

Dependencies

python3.6+

parl>=2.0.0

Start Training

Modify the config in qmix_config.py.
Start training:
```
python train.py
```
View the training process with tensorboard:
```
tensorboard --logdir ./
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QMIX

QMIX

README.md

QMIX based on PARL and PaddlePaddle2.0

QMIX

StarCraft2 Environment

Benchmark Results

How to Use

Dependencies

Start Training

Files

QMIX

Directory actions

More options

Directory actions

More options

Latest commit

History

QMIX

Folders and files

parent directory

README.md

QMIX based on PARL and PaddlePaddle2.0

QMIX

StarCraft2 Environment

Benchmark Results

How to Use

Dependencies

Start Training