Multi-mapping Image-to-Image Translation via Learning Disentanglement

September 17, 2019 · Entered Twilight · 🏛 Neural Information Processing Systems

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, data, encoder, images, models, options, requirements.txt, scripts, test.py, train.py, util

Authors Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li arXiv ID 1909.07877 Category cs.CV: Computer Vision Citations 109 Venue Neural Information Processing Systems Repository https://github.com/Xiaoming-Yu/DMIT ⭐ 114 Last Checked 2 months ago

Abstract

Recent advances of image-to-image translation focus on learning the one-to-many mapping from two aspects: multi-modal translation and multi-domain translation. However, the existing methods only consider one of the two perspectives, which makes them unable to solve each other's problem. To address this issue, we propose a novel unified model, which bridges these two objectives. First, we disentangle the input images into the latent representations by an encoder-decoder architecture with a conditional adversarial training in the feature space. Then, we encourage the generator to learn multi-mappings by a random cross-domain translation. As a result, we can manipulate different parts of the latent representations to perform multi-modal and multi-domain translations simultaneously. Experiments demonstrate that our method outperforms state-of-the-art methods.