ValueError when using SAC with co-optimization #17

zxhuang97 · 2021-02-15T07:21:13Z

Thank you for sharing this wonderful repository. When I try to run experiments with co-optimization, PPO is fine. But when I try SAC there is a strange error.

ValueError: Have multiple policies {'human': <ray.rllib.policy.tf_policy_template.SACTFPolicy object at 0x7f8ec4436470>, 'robot': <ray.rllib.policy.tf_policy_template.SACTFPolicy object at 0x7f8ebc685ef0>}, but the env <NormalizeActionWrapper<FeedingSawyerHumanEnv instance>> is not a subclass of BaseEnv, MultiAgentEnv or ExternalMultiAgentEnv?

This seems to be related to this issue of RLlib.

The text was updated successfully, but these errors were encountered:

Zackory · 2021-03-05T21:43:11Z

Thanks for pointing out this issue!
I have been looking into this recently, but haven't found the solution quite yet.
I'll keep you updated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError when using SAC with co-optimization #17

ValueError when using SAC with co-optimization #17

zxhuang97 commented Feb 15, 2021

Zackory commented Mar 5, 2021

ValueError when using SAC with co-optimization #17

ValueError when using SAC with co-optimization #17

Comments

zxhuang97 commented Feb 15, 2021

Zackory commented Mar 5, 2021