-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/fix sampler madqn #477
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @EdanToledo🙂 See my few comments. Also, I see the checks are failing?
...tf/debugging/simple_spread/feedforward/decentralised/run_maddpg_scale_trainers_fixed_nets.py
Outdated
Show resolved
Hide resolved
I see I made my comments on the ddpg changes, seems like that shouldn't be in this PR, but the same goes for DQN. Other than the minor variable/method name changes, all looks good |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me 🙂
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good
What?
Fixes multiple trainers, for different agents with different architectures for madqn
Why?
So that mava can be used for hierarchical reinforcement learning
How?
Implements @DriesSmit code in MADQN