NLP-Transformer_XL

An implementation of the Transformer-XL in Tensorflow 2.0. A minor difference between this implementation and that in the paper is that the gradient is allowed to propagate through the different segments.

Please note that this repository is still work-in-progress.

Training

To process the data, first run

python process_reddit_jokes_subword.py

followed by

python train_reddit_jokes_subword_tf_ver2_gpt_xl.py

to train the model. Run

python infer_reddit_jokes_subword_tf_ver2_gpt_xl.py

to perform inference.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
byte_pair_encoding.py		byte_pair_encoding.py
infer_reddit_jokes_subword_tf_ver2_gpt_xl.py		infer_reddit_jokes_subword_tf_ver2_gpt_xl.py
process_reddit_jokes_subword.py		process_reddit_jokes_subword.py
tf_ver2_gpt_xl.py		tf_ver2_gpt_xl.py
tf_ver2_gpt_xl_v1.py		tf_ver2_gpt_xl_v1.py
train_reddit_jokes_subword_tf_ver2_gpt_xl.py		train_reddit_jokes_subword_tf_ver2_gpt_xl.py
train_reddit_jokes_subword_tf_ver2_gpt_xl_v1.py		train_reddit_jokes_subword_tf_ver2_gpt_xl_v1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

byte_pair_encoding.py

byte_pair_encoding.py

infer_reddit_jokes_subword_tf_ver2_gpt_xl.py

infer_reddit_jokes_subword_tf_ver2_gpt_xl.py

process_reddit_jokes_subword.py

process_reddit_jokes_subword.py

tf_ver2_gpt_xl.py

tf_ver2_gpt_xl.py

tf_ver2_gpt_xl_v1.py

tf_ver2_gpt_xl_v1.py

train_reddit_jokes_subword_tf_ver2_gpt_xl.py

train_reddit_jokes_subword_tf_ver2_gpt_xl.py

train_reddit_jokes_subword_tf_ver2_gpt_xl_v1.py

train_reddit_jokes_subword_tf_ver2_gpt_xl_v1.py

Repository files navigation

NLP-Transformer_XL

Training

About

Releases

Packages

Languages

WD-Leong/NLP-Transformer-XL

Folders and files

Latest commit

History

Repository files navigation

NLP-Transformer_XL

Training

About

Topics

Resources

Stars

Watchers

Forks

Languages