Connections with transformers? #44

askerlee · 2022-09-13T08:19:03Z

Just came across your paper, and found that the formulation of co-attention is quiote similar to transformers:

Especially, a few (but not all) major ingredients, i.e., Q, V projections, attention computed with softmax after dot-product, also appear in transformers.

Considering your work was earlier than the transformer paper, do you think that it may have inspired transformers? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Connections with transformers? #44

Connections with transformers? #44

askerlee commented Sep 13, 2022

Connections with transformers? #44

Connections with transformers? #44

Comments

askerlee commented Sep 13, 2022