- Vanilla Seq2seq
- Conditioning decoder on external context
- Dynamic Decoder (teacher forcing removed)
- Language Modeling with encoder
- Bidirectional encoder
- RNNSearch : Soft Alignment
- Multi-turn Conversation Modeling with Hierarchical Recurrent Encoder-Decoder (HRED)
- Memory augmentation
- Ed Grefenstette, Beyond Sequence to Sequence with Augmented RNNs video, slides
- Neural Machine Translation by Jointly Learning to Align and Translate
- End-to-End Memory Networks
- Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
- Advanced Seq2seq in tensorflow
- Beam Search
- Multi-task Learning in tensorflow
- Enhanced Attention-based Encoder-Decoder model for NMT
- thank you @ematvey, for helping me understand
raw_rnn