fix: shared weights with agent type #428

AsadJeewa · 2022-02-23T08:48:26Z

What?

Updated agent network types to only share weights between agent types
#427

Why?

When setting shared_weights to true, a single network is created for all agents. It should not be possible to share weights across agent types

How?

changed self._agent_net_keys in systems.py to assign the correct keys (for MADQN, MAPPO, MADDPG)

Extra

If we are happy with this change, would we then need to benchmark?

arnupretorius

Thanks @AsadJeewa! 😄

Just see my few comments. Also, did we test this? At the very least we should do some runs on environments with and without different agent types.

mava/systems/tf/maddpg/system.py

mava/systems/tf/mappo/system.py

AsadJeewa · 2022-02-24T07:29:11Z

I tested that everything works in multiple different environments with different (openspiel Tic Tac Toe, Petting Zoo Pong) or single-agent types (debugging, Petting Zoo Multiwalker, Flatland) to make sure that the networks were being assigned correctly and that execution proceeds. Since shared_weights default value is True, I think that we should run a small set of benchmarks @RuanJohn

mmorris44

1 potential issue in 3 places
1 debug print statement
1 minor comment

examples/tf/petting_zoo/atari/pong/recurrent/decentralised/run_madqn.py

mava/systems/tf/maddpg/system.py

mava/systems/tf/madqn/system.py

mava/systems/tf/mappo/system.py

AsadJeewa · 2022-02-24T09:50:49Z

This line (https://github.com/instadeepai/Mava/blob/develop/mava/specs.py#L68) assumes that all environments will follow the naming convention of type_identifier but this is not always the case e.g. OpenSpiel TicTacToe: agents are named player_0 and player_1 so Mava assumes they are the same type. I have created a separate issue for this (#434)

mmorris44

PR looks good to go from my side.
Thanks @AsadJeewa!

arnupretorius

Thanks @AsadJeewa! 👍

fix: shared weights with agent type

a5067f4

AsadJeewa added the bug Something isn't working label Feb 23, 2022

AsadJeewa added this to the Mava stable systems release milestone Feb 23, 2022

AsadJeewa self-assigned this Feb 23, 2022

AsadJeewa requested review from arnupretorius, KaleabTessera, DriesSmit, mmorris44 and RuanJohn as code owners February 23, 2022 08:48

AsadJeewa requested a review from jcformanek as a code owner February 23, 2022 08:48

pull-request-size bot added the size/M label Feb 23, 2022

AsadJeewa linked an issue Feb 23, 2022 that may be closed by this pull request

[BUG] Shared weights across agent types #427

Closed

3 tasks

arnupretorius reviewed Feb 23, 2022

View reviewed changes

mava/systems/tf/maddpg/system.py Outdated Show resolved Hide resolved

mava/systems/tf/mappo/system.py Outdated Show resolved Hide resolved

fix: refactor shared weights comments

aa34ef2

mmorris44 suggested changes Feb 24, 2022

View reviewed changes

AsadJeewa added 2 commits February 24, 2022 10:48

fix: remove debug statement

5b49e68

fix: explicitly set shared weights var

67bc39d

Merge branch 'develop' into bugfix/shared_weights

6718c21

mmorris44 approved these changes Feb 25, 2022

View reviewed changes

Merge branch 'develop' into bugfix/shared_weights

4631b30

arnupretorius approved these changes Mar 1, 2022

View reviewed changes

AsadJeewa added 2 commits March 1, 2022 09:43

Merge branch 'develop' into bugfix/shared_weights

02384fb

Merge branch 'develop' into bugfix/shared_weights

3508be8

AsadJeewa merged commit f83f125 into develop Mar 1, 2022

AsadJeewa deleted the bugfix/shared_weights branch March 1, 2022 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: shared weights with agent type #428

fix: shared weights with agent type #428

AsadJeewa commented Feb 23, 2022 •

edited

Loading

arnupretorius left a comment

AsadJeewa commented Feb 24, 2022

mmorris44 left a comment

AsadJeewa commented Feb 24, 2022 •

edited

Loading

mmorris44 left a comment

arnupretorius left a comment

fix: shared weights with agent type #428

fix: shared weights with agent type #428

Conversation

AsadJeewa commented Feb 23, 2022 • edited Loading

What?

Why?

How?

Extra

arnupretorius left a comment

Choose a reason for hiding this comment

AsadJeewa commented Feb 24, 2022

mmorris44 left a comment

Choose a reason for hiding this comment

AsadJeewa commented Feb 24, 2022 • edited Loading

mmorris44 left a comment

Choose a reason for hiding this comment

arnupretorius left a comment

Choose a reason for hiding this comment

AsadJeewa commented Feb 23, 2022 •

edited

Loading

AsadJeewa commented Feb 24, 2022 •

edited

Loading