We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi!
I was looking at the progress monitor, which the paper shows it's calculation as:
But this line in the code shows a slightly different equation (one closing parenthesis changed position):
regretful-agent/tasks/R2R-pano/models/policy_model.py
Line 133 in 5caf7b5
This would translate to (in the paper notation):
The difference is that in the first, the tanh is included within the sigmoid, and, in the second equation, it's outside the sigmoid.
tanh
Is there any major difference between using these two equations?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hi!
I was looking at the progress monitor, which the paper shows it's calculation as:
But this line in the code shows a slightly different equation (one closing parenthesis changed position):
regretful-agent/tasks/R2R-pano/models/policy_model.py
Line 133 in 5caf7b5
This would translate to (in the paper notation):
The difference is that in the first, the
tanh
is included within the sigmoid, and, in the second equation, it's outside the sigmoid.Is there any major difference between using these two equations?
The text was updated successfully, but these errors were encountered: