This repository containes the code to reproduce the results and figures of the paper: Image-to-Image MLP-mixer for Image Reconstruction.
We used 3 data sets in the paper. The first one is a mini version of ImageNet and can be downloaded from here. The second one is the Smart Phone Image Denoising Dataset (SIDD). Specifically, we used the SIDD-Small Dataset with sRGB images only, which can be directly downloaded here.
The data needs to be preprocessed before using it for training. A notebook called "prepare_data.ipynb" exists in each of the folders "ImageNet" and "SIDD" and is responsible for processing the data such that it's easily used for training, and then saving the new processed data. The file "config.py" in both folders should be edited with the path to the original dataset and the path in which the processed data should be saved.
The third one is the fastMRI knee dataset and can be downloaded from the official website.
The code is written entirely in python and relies mainly on Pytorch. It has been tested with the following:
- Operating System: Ubuntu 18.04.4
- Cuda: 11.0
- Python: 3.8.3
- Pytorch: 1.7.1
- Tensorflow: 2.3.0
- Jupyter Notebook: 6.3.0
- Numpy: 1.19.2
- Matplotlib: 3.3.4
- PIL: 7.2.0
- Skimage: 0.18.1
- BM3D: 3.0.9
- Einops: 0.3.0
- timm: 0.3.2
- runstats: 2.0.0
- h5py: 2.10.0
- tqdm: 4.46.0
The folders "ImageNet" and "SIDD" contain the denoising experiments. In the file "config.py", the path to store the trained networks should be entered. If you wish to use tensorboard while training, you can enter the path to store its logs in "config.py". After running "prepare_data.ipynb", the processed dataset will be saved. The notebook "train.ipynb" can now be run for training. At the beginning of the notebook, the hyperparameters for the optimizer can be configured and the network to be trained can be chosen. Once training is over, the notebook "evaluate.ipynb" can be run to display the denoising performance of the trained network. All networks used in the paper are in the folder "networks". Finally, the notebook "BM3D.ipynb" can be used to evaluate the BM3D algorithm.
The code to replicate the figures from the inductive bias section is in the folder "untrained". The car image used in these experiments is included in the folder.
The code to replicate the results from the compressive sensing section is in the folder "compressed_sensing". Before running the notebook "training_and_validation.ipynb", please adjust the data path, training data size, and select the model.
- The U-Net implementation and the fastmri library are obtained from the fastMRI repository.
- Our implementation of patch merging and expanding is adapted from Swin-Transformer and Swin-Unet.
- A Pytorch implementation of the original MLP-Mixer paper can be found here.
- Our implementation of the Vision Transformer is adapted from ConViT
Thank you for making your code publicly accessible.
@misc{
mansour2022imagetoimage,
title={Image-to-Image MLP-mixer for Image Reconstruction},
author={Youssef Mansour and Kang Lin and Reinhard Heckel},
year={2022},
eprint={2202.02018},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
This repository is MIT licensed.