Feature/fix sampler madqn #477

EdanToledo · 2022-04-12T14:41:39Z

What?

Fixes multiple trainers, for different agents with different architectures for madqn

Why?

So that mava can be used for hierarchical reinforcement learning

How?

Implements @DriesSmit code in MADQN

…re support for madqn

mava/systems/tf/madqn/system.py

DriesSmit

Thanks @EdanToledo🙂 See my few comments. Also, I see the checks are failing?

...tf/debugging/simple_spread/feedforward/decentralised/run_maddpg_scale_trainers_fixed_nets.py

mava/systems/tf/maddpg/builder.py

sash-a · 2022-04-12T15:35:02Z

I see I made my comments on the ddpg changes, seems like that shouldn't be in this PR, but the same goes for DQN.

Other than the minor variable/method name changes, all looks good

…refactor

DriesSmit

Looks good to me 🙂

sash-a

looks good

EdanToledo added 2 commits April 12, 2022 15:15

feat: add multiple trainer, multiple agent, multiple agent architectu…

5774a57

…re support for madqn

fix: allow net_spec keys to be passed into create_default_networks

1a84f2f

EdanToledo requested review from arnupretorius, KaleabTessera, DriesSmit, mmorris44, AsadJeewa, RuanJohn and jcformanek as code owners April 12, 2022 14:41

pull-request-size bot added the size/L label Apr 12, 2022

DriesSmit reviewed Apr 12, 2022

View reviewed changes

mava/systems/tf/madqn/system.py Outdated Show resolved Hide resolved

DriesSmit reviewed Apr 12, 2022

View reviewed changes

...tf/debugging/simple_spread/feedforward/decentralised/run_maddpg_scale_trainers_fixed_nets.py Outdated Show resolved Hide resolved

fix: linting

18d420f

sash-a reviewed Apr 12, 2022

View reviewed changes

mava/systems/tf/maddpg/builder.py Outdated Show resolved Hide resolved

sash-a reviewed Apr 12, 2022

View reviewed changes

mava/systems/tf/maddpg/builder.py Outdated Show resolved Hide resolved

EdanToledo and others added 4 commits April 13, 2022 14:48

fix: add if statement to check if default net spec keys is given and …

10185f6

…refactor

fix: linting issue

80cf51d

Merge branch 'develop' into feature/fix_sampler_madqn

c4c2509

fix: Small update variable naming.

5832921

pull-request-size bot added size/M and removed size/L labels Apr 22, 2022

DriesSmit previously approved these changes Apr 22, 2022

View reviewed changes

sash-a previously approved these changes Apr 22, 2022

View reviewed changes

fix: Small fix to PPO variable naming.

f494b83

DriesSmit dismissed stale reviews from sash-a and themself via f494b83 April 22, 2022 10:08

DriesSmit approved these changes Apr 22, 2022

View reviewed changes

sash-a approved these changes Apr 22, 2022

View reviewed changes

DriesSmit merged commit 1eef1eb into develop Apr 22, 2022

DriesSmit deleted the feature/fix_sampler_madqn branch April 22, 2022 10:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/fix sampler madqn #477

Feature/fix sampler madqn #477

EdanToledo commented Apr 12, 2022

DriesSmit left a comment •

edited

Loading

sash-a commented Apr 12, 2022 •

edited

Loading

DriesSmit left a comment

sash-a left a comment

Feature/fix sampler madqn #477

Feature/fix sampler madqn #477

Conversation

EdanToledo commented Apr 12, 2022

What?

Why?

How?

DriesSmit left a comment • edited Loading

Choose a reason for hiding this comment

sash-a commented Apr 12, 2022 • edited Loading

DriesSmit left a comment

Choose a reason for hiding this comment

sash-a left a comment

Choose a reason for hiding this comment

DriesSmit left a comment •

edited

Loading

sash-a commented Apr 12, 2022 •

edited

Loading