German to English translation with Seq2Seq Transformer [1] in JAX.
- Install the requirements.
- Set the checkpoint directory in the config file.
- Run train.py to train a model and save a checkpoint.
- Run translate.py to load the checkpoint for translation.
- The code is tested on Python 3.11.
- This is an educational repo. The dataset is small and the translation quality is not great.
- The encoder part loosely follows [2] but is not the same code and is not guaranteed to produce the same results.
[1] Vaswani, Ashish, et al. "Attention is all you need." NeurIPS, (2017).