Skip to content

Latest commit

 

History

History
98 lines (70 loc) · 2.35 KB

README.md

File metadata and controls

98 lines (70 loc) · 2.35 KB

Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis

A PyTorch implementation of Deep Fusion GAN by Ming Tao, Hao Tang, Songsong Wu, Nicu Sebe, Xiaoyuan Jing, Fei Wu, Bingkun Bao.

Dependencies

Dependencies:

python~=3.7.9
torch~=1.8.0
numpy~=1.21.4
pandas~=1.2.2
torchvision~=0.9.0
Pillow~=7.2.0
matplotlib~=3.3.4
tqdm~=4.62.3

To install required packages use:

pip install -r requirements.txt

Experiments

Use train_example.ipynb, metrics_evaluation.ipynb and eval_example.ipynb to train, eval and generation.

Deep Fusion GAN architecture

Drawing

The architecture of the proposed DF-GAN for text-to-image synthesis. DF-GAN generates high-resolution images directly by one pair of generator and discriminator and fuses the text information and visual feature maps through multiple Deep text-image Fusion Blocks (DFBlock) in UPBlocks.

Losses per first 100 epoch

Drawing

Metrics per epochs

Drawing

Final metrics

Ours Paper
IS 4.43 5.10
FID 18.10 21.42

Examples of generation

Common sample

Drawing

Example of sixteen generated birds.

A small yellow bird with black wings and crown

Drawing

This bird has wings that are red and has an orange bill

Drawing

A small yellow bird with green wings

Drawing

A small red bird has grey wings

Drawing

A small red bird has grey long wings

Drawing