This repository includes pytorch-lightning implementations of "Attention is All You Need" (Vaswani et al., NIPS 2017) and "Weighted Transformer Network for Machine Translation" (Ahmed et al., arXiv 2017)
- python >= 3.5
- torch >= 1.3.0
- pytorch-lighting >= 0.9.0
- torchtext >= 0.4.0
- spacy >= 2.2.2
- dill
-
Generate the
m30k_deen_shr.pkl
file followed the repo jadore801120/attention-is-all-you-need -
Run the train scripts.
python run configs/transformer.py
-
Show the training log.
tensorboard --logdir work_dirs/logs/Transformer/0.1.0/
Paper
- Vaswani et al., "Attention is All You Need", NIPS 2017
- Ahmed et al., "Weighted Transformer Network for Machine Translation", Arxiv 2017
Code