Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dependency of actions #2

Open
Chenhait opened this issue May 29, 2022 · 2 comments
Open

Dependency of actions #2

Chenhait opened this issue May 29, 2022 · 2 comments

Comments

@Chenhait
Copy link

Hi, thank you so much for offering this code.

I read the code: hybrid_sac_platform.py

And, the paper said:
If the continuous action ac must depend on the discrete action chosen by the agent, then ad
can be used as input when computing µc and σc.

I think the continuous action depend on the discrete action, but I did not find the
where using ad to compute µc and σc.

Could you tell me where to show this dependency of actions.

Thank you very much for any advance!!!

@nisheeth-golakiya
Copy link
Owner

Hi @Chenhait, I'm glad you're interested in the code.

I agree that for the Platform environment, the continuous actions depend on discrete actions. Namely, there are 3 discrete actions (run, hop, leap), each associated with 1 continuous component ("how much" to run/hop/leap).

Consider the case where, in the policy network, we have just one common set of μc and σc for all discrete actions. In this case, we would definitely need to condition it on the chosen discrete action.

However, in the implementation here, we have separate μc and σc for each discrete action. Hence, we don't need to condition on the chosen discrete action.

I hope this clears things up.

@Chenhait
Copy link
Author

This is very helpful!!!
Thank you so much for your reply and wonderful job!!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants