seq2seq-translation-batched: Bahdanau attention does not work #82

juditacs · 2017-12-28T15:26:18Z

__init__.py fails with AttributeError, max_length does not exist. Fixing this results in a concat error in the Attn class:

    45         elif self.method == 'concat':
---> 46             energy = self.attn(torch.cat((hidden, encoder_output), 1))
     47             energy = self.v.dot(energy)
     48             return energy

RuntimeError: dimension out of range (expected to be in range of [-1, 0], but got 1)

replacing the dimension in line 46 to 0 results in this error:

    44 
     45         elif self.method == 'concat':
---> 46             energy = self.attn(torch.cat((hidden, encoder_output), 0))
     47             energy = self.v.dot(energy)
     48             return energy

RuntimeError: inconsistent tensor sizes at /opt/conda/conda-bld/pytorch_1512386481460/work/torch/lib/THC/generic/THCTensorMath.cu:157

The text was updated successfully, but these errors were encountered:

aevilorz · 2018-07-05T06:40:11Z

I think there could be a solution to attention calculation in matrix form in #107 .

anantzoid · 2018-10-30T18:46:15Z

#119 addresses some of the mentioned issues.

juditacs mentioned this issue Jan 30, 2018

batched seq2seq: Compute attention as matrix ops instead of for loops #83

Closed

anantzoid mentioned this issue Oct 30, 2018

Bahdanau Decoder Implementation #23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seq2seq-translation-batched: Bahdanau attention does not work #82

seq2seq-translation-batched: Bahdanau attention does not work #82

juditacs commented Dec 28, 2017

aevilorz commented Jul 5, 2018

anantzoid commented Oct 30, 2018 •

edited

Loading

seq2seq-translation-batched: Bahdanau attention does not work #82

seq2seq-translation-batched: Bahdanau attention does not work #82

Comments

juditacs commented Dec 28, 2017

aevilorz commented Jul 5, 2018

anantzoid commented Oct 30, 2018 • edited Loading

anantzoid commented Oct 30, 2018 •

edited

Loading