Goal of this educational repository is to provide a self-contained, minimalistic implementation of diffusion models using Pytorch.
Many implementations of diffusion models can be a bit overwhelming. Here, superminddpm
: under 200 lines of code, fully self contained implementation of DDPM with Pytorch is a good starting point for anyone who wants to get started with Denoising Diffusion Models, without having to spend time on the details.
Simply:
$ python superminddpm.py
Above script is self-contained. (Of course, you need to have pytorch and torchvision installed. Latest version should suffice. We do not use any cutting edge features.)
If you want to use the bit more refactored code, that runs CIFAR10 dataset:
$ python train_cifar10.py
Above result took about 2 hours of training on single 3090 GPU. Top 8 images are generated, bottom 8 are ground truth.
Here is another example, trained on 100 epochs (about 1.5 hours)
Currently has:
- Tiny implementation of DDPM
- MNIST, CIFAR dataset.
- Simple unet structure. + Simple Time embeddings.
- CelebA dataset.
TODOS
- DDIM
- Classifier Guidance
- Multimodality
- Using more parameter yields better result for MNIST.
- More comments in superminddpm.py