Question to implement #3

seunghwan1228 · 2020-03-10T07:40:02Z

First, Thank you for your work

By following your description, I'm trying to implement the attention layers each with tf 2.1.

I have a question that does the line 221 requires to be add a "squeeze" the inputs ?
attention_score = RepeatVector(source_hidden_states.shape[1])(tf.squeeze(attention_score))
because if i understood the full code correctly, the h_t is already expanded_dim and its attention score is (B, 1, H) before getting in the Repeatvector. However, when i feeding the (B, 1, H) to repeat vector, it rise an error as [repeat_vector is incompatible with the layer: expected ndim=2, found ndim=3.]

Thank you

attention-mechanisms/layers.py

Line 221 in 37f131d

    
           attention_score = RepeatVector(source_hidden_states.shape[1])(attention_score)          # (B, S*, H)

seunghwan1228 closed this as completed Mar 10, 2020

seunghwan1228 reopened this Mar 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question to implement #3

Question to implement #3

seunghwan1228 commented Mar 10, 2020

Question to implement #3

Question to implement #3

Comments

seunghwan1228 commented Mar 10, 2020