Skip to content

Latest commit

 

History

History
80 lines (58 loc) · 7.77 KB

README.md

File metadata and controls

80 lines (58 loc) · 7.77 KB

CycleGAN (ICCV'2017)

CycleGAN: Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Task: Image2Image

Abstract

Image-to-image translation is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs. However, for many tasks, paired training data will not be available. We present an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples. Our goal is to learn a mapping G: X \rightarrow Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss. Because this mapping is highly under-constrained, we couple it with an inverse mapping F: Y \rightarrow X and introduce a cycle consistency loss to push F(G(X)) \approx X (and vice versa). Qualitative results are presented on several tasks where paired training data does not exist, including collection style transfer, object transfiguration, season transfer, photo enhancement, etc. Quantitative comparisons against several prior methods demonstrate the superiority of our approach.

Results and Models

Results from CycleGAN trained by mmagic

We use FID and IS metrics to evaluate the generation performance of CycleGAN.1 https://download.openmmlab.com/mmediting/cyclegan/refactor/cyclegan_lsgan_resnet_in_1x1_80k_facades_20210902_165905-5e2c0876.pth https://download.openmmlab.com/mmediting/cyclegan/refactor/cyclegan_in_1x1_80k_facades_20210902_165905-5e2c0876.pth

Model Dataset FID IS Download
Ours facades 124.8033 1.792 model | log 2
Ours facades-id0 125.1694 1.905 model
Ours summer2winter 83.7177 2.771 model
Ours summer2winter-id0 83.1418 2.720 model
Ours winter2summer 72.8025 3.129 model
Ours winter2summer-id0 73.5001 3.107 model
Ours horse2zebra 64.5225 1.418 model
Ours horse2zebra-id0 74.7770 1.542 model
Ours zebra2horse 141.1517 3.154 model
Ours zebra2horse-id0 134.3728 3.091 model

FID comparison with official:

Dataset facades facades-id0 summer2winter summer2winter-id0 winter2summer winter2summer-id0 horse2zebra horse2zebra-id0 zebra2horse zebra2horse-id0 average
official 123.626 119.726 77.342 76.773 72.631 74.239 62.111 77.202 138.646 137.050 95.935
ours 124.8033 125.1694 83.7177 83.1418 72.8025 73.5001 64.5225 74.7770 141.1571 134.3728 97.79

IS comparison with evaluation:

Dataset facades facades-id0 summer2winter summer2winter-id0 winter2summer winter2summer-id0 horse2zebra horse2zebra-id0 zebra2horse zebra2horse-id0 average
official 1.638 1.697 2.762 2.750 3.293 3.110 1.375 1.584 3.186 3.047 2.444
ours 1.792 1.905 2.771 2.720 3.129 3.107 1.418 1.542 3.154 3.091 2.462

Note:

  1. With a larger identity loss, the image-to-image translation becomes more conservative, which makes less changes. The original authors did not say what is the best weight for identity loss. Thus, in addition to the default setting, we also set the weight of identity loss to 0 (denoting id0) to make a more comprehensive comparison.
  2. This is the training log before refactoring. Updated logs will be released soon.

Citation

@inproceedings{zhu2017unpaired,
  title={Unpaired image-to-image translation using cycle-consistent adversarial networks},
  author={Zhu, Jun-Yan and Park, Taesung and Isola, Phillip and Efros, Alexei A},
  booktitle={Proceedings of the IEEE international conference on computer vision},
  pages={2223--2232},
  year={2017},
  url={https://openaccess.thecvf.com/content_iccv_2017/html/Zhu_Unpaired_Image-To-Image_Translation_ICCV_2017_paper.html},
}