Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/maddpg obs optim #459

Merged
merged 17 commits into from
Apr 22, 2022
Merged

Feat/maddpg obs optim #459

merged 17 commits into from
Apr 22, 2022

Conversation

AsadJeewa
Copy link
Contributor

What?

Temp branch for maddpg experiments

Why?

Benchmark to confirm updating observation network with only critic is best

How?

Benchmark with only critic gradients and previous (both actor and critic) and confirm no regression

@AsadJeewa AsadJeewa marked this pull request as draft March 22, 2022 21:33
@pull-request-size pull-request-size bot added size/M and removed size/L labels Mar 24, 2022
@pull-request-size pull-request-size bot added size/S and removed size/M labels Apr 20, 2022
@AsadJeewa AsadJeewa marked this pull request as ready for review April 20, 2022 08:55
@AsadJeewa
Copy link
Contributor Author

AsadJeewa commented Apr 20, 2022

By design, DDPG only uses the Critic loss to update the shared observation networks hence the results of previous investigations were inconclusive Neptune Results This has opened up 2 further investigations #481 & #482

arnupretorius
arnupretorius previously approved these changes Apr 20, 2022
Copy link
Collaborator

@arnupretorius arnupretorius left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. Thanks @AsadJeewa 👍

DriesSmit
DriesSmit previously approved these changes Apr 21, 2022
Copy link
Contributor

@DriesSmit DriesSmit left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks @AsadJeewa 🔥

Copy link
Contributor

@KaleabTessera KaleabTessera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @AsadJeewa !!! 🔥

Just requested a minor change.

mava/systems/tf/maddpg/networks.py Outdated Show resolved Hide resolved
@AsadJeewa AsadJeewa dismissed stale reviews from DriesSmit and arnupretorius via 7c04224 April 21, 2022 12:49
@AsadJeewa AsadJeewa merged commit dc7b93c into develop Apr 22, 2022
@AsadJeewa AsadJeewa deleted the feat/maddpg_obs_optim branch April 22, 2022 07:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants