请问实现luong attn的时候为什么是用context来计算的 #5

kwz219 · 2020-01-11T08:55:33Z

论文中写的是用target的当前hiddenstate和整个Encoder_outputs来计算，但是第七课seq2seq的代码中是这样算的：
context_in = self.linear_in(context.view(batch_size*input_len, -1)).view( batch_size, input_len, -1)
attn = torch.bmm(output, context_in.transpose(1,2))
这里的context是不是应该更换成当前时刻的hiddenstate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请问实现luong attn的时候为什么是用context来计算的 #5

请问实现luong attn的时候为什么是用context来计算的 #5

kwz219 commented Jan 11, 2020

请问实现luong attn的时候为什么是用context来计算的 #5

请问实现luong attn的时候为什么是用context来计算的 #5

Comments

kwz219 commented Jan 11, 2020