Image captioning assignment bug
As our eminent teaching assistant noted, there is an error in the reduce_sum in the UniformAttention class.
Incorrect line:
context_vector = tf.reduce_sum(tf.expand_dims(attention_weights,axis=-1)*
feature_vectors, axis=-1)
Correct line :
context_vector = tf.reduce_sum(tf.expand_dims(attention_weights, axis=-1)* feature_vectors, axis=1)
We should sum over the 49 spatial dimensions (axis=1), not over the 256-dimensional feature dimension (axis=-1)
Publisert 17. sep. 2019 19:37
- Sist endret 17. sep. 2019 19:37