Entropy calculation not useful #8

Myrkiriad-coder · 2021-04-18T12:37:40Z

Describe the bug
In ppo_continous_tensorflow.py, when you calculate entropy with:
dist_entropy = tf.math.reduce_mean(self.distributions.entropy(action_mean, self.std))
since entropy only depends on std and std is a static parameter, dist_entropy has always the same value all the time.
Thus, entropy loss has no effect on learning.

To Reproduce
Launch any env and stop your debugger on dist_entropy. Check that it has the same value for every batch at any given point during learning.

Expected behavior
Std shall not be static but somehow represent real prediction confidence of the network.

The text was updated successfully, but these errors were encountered:

wisnunugroho21 · 2021-07-25T08:26:16Z

Sorry if I'm so late to reply. Thank you for the advice.
Actually, you can set the entropy coefficient to 0 if you use static parameters.

I really think using a neural network to calculate the std is much better than using static parameters.
I forgot to do it in this repository

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Entropy calculation not useful #8

Entropy calculation not useful #8

Myrkiriad-coder commented Apr 18, 2021

wisnunugroho21 commented Jul 25, 2021

Entropy calculation not useful #8

Entropy calculation not useful #8

Comments

Myrkiriad-coder commented Apr 18, 2021

wisnunugroho21 commented Jul 25, 2021