Seq2seq Attention

gold_piggy · July 4, 2019, 1:39am

http://d2l.ai/chapter_attention-mechanisms/seq2seq-attention.html

ThePeshwa · June 2, 2020, 6:50am

Does seq2seq with attention work better than directly attending on the input embeddings?

Instead of using a transformer, if we just employ self-attention as a pooling mechanism over the input embeddings, would the performance be remarkably different from seq2seq with attention? Is this dataset specific?

Topic		Replies	Views
Attention Mechanism D2L Book	0	344	November 28, 2018
Natural Language Inference: Using Attention D2L Book	0	355	January 21, 2020
Introduction D2L Book	7	2088	November 15, 2019
Bidirectional Encoder Representations from Transformers (BERT) D2L Book	0	398	March 31, 2020
Preface :) D2L Book	28	2843	May 12, 2020

Seq2seq Attention

Related Topics