We examine the problem of controlling divergences for latent space regularisation in variational autoencoders. Specifically, when aiming to reconstruct example via latent space , while balancing this against the need for generalisable latent representations. We present a regularisation mechanism based on the skew geometric-Jensen-Shannon divergence . We find a variation in , motivated by limiting cases, which leads to an intuitive interpolation between forward and reverse KL in the space of both distributions and divergences. We motivate its potential benefits for VAEs through low-dimensional examples, before presenting quantitative and qualitative results. Our experiments demonstrate that skewing our variant of , in the context of -VAEs, leads to better reconstruction and generation when compared to several baseline VAEs. Our approach is entirely unsupervised and utilises only one hyperparameter which can be easily interpreted in latent space.
Paper https://arxiv.org/abs/2006.10599
The main package requirments are given in requirements.txt
To generate any of the models given in the paper, use the command:
python .\main.py <experiment_name> --dataset <dataset> --epochs <n_epochs> --latent-dim <n_latents> --rec-dist <reconstruction_loss> --loss <divergence_loss>
and add any loss-specific flags (e.g. --GJS-A
sets the alpha value in skew-geometric Jensen-Shannon divergence). All flags and options can be found in main.py
.
The notebooks
folder consists of several notebooks that recreate the experiments presented in the paper. However, the folder relies on a folder /results
that we have omitted due to size limitations, so each result from Figure 3 onwards relies on an experimental run.
In particular:
gjs.ipynb
anddgjs.ipynb
- showcase the usage of GJS-VAEs and GJS*-VAEs on the four datasets, respectively (Figure 3 in the main text and Appendix B)reconstruction.ipynb
andreconstruction_grid.ipynb
- output the reconstruction images of each experimentalpha_change.ipynb
- showcases experiments that investigate the influence of \alphagaussian_example.ipynb
- provides the examples given in Figure1 in the main text.comparison.py
- provides the examples given in Figure2 in the main text.
For attribution in academic contexts, please cite this work as
@inproceedings{deasy2020_gjs,
title={Constraining Variational Inference with Geometric Jensen-Shannon Divergence}
author={Jacob Deasy and Nikola Simidjievski and Pietro Li{\`o}},
booktitle = {Advances in Neural Information Processing Systems},
year={2020}
}