基于改進(jìn)循環(huán)生成式對抗網(wǎng)絡(luò)的圖像風(fēng)格遷移

張驚雷; 厚雅偉

doi:10.11999/JEIT190407

基于改進(jìn)循環(huán)生成式對抗網(wǎng)絡(luò)的圖像風(fēng)格遷移

doi: 10.11999/JEIT190407

張驚雷^,,
厚雅偉

1.
天津理工大學(xué)電氣電子工程學(xué)院天津 300384
2.
天津市復(fù)雜系統(tǒng)控制理論及應(yīng)用重點實驗室天津 300384

詳細(xì)信息

作者簡介:
張驚雷：男，1969，教授，博士，研究方向為模式識別、圖像處理等

厚雅偉：男，1995，碩士生，研究方向為圖像處理、目標(biāo)檢測等

通訊作者:
張驚雷　zhangjinglei@tjut.edu.cn

中圖分類號: TN911.73; TP317
計量
- 文章訪問數(shù): 5044
- HTML全文瀏覽量: 2404
- PDF下載量: 315
- 被引次數(shù): 0
出版歷程
- 收稿日期: 2019-06-05
- 修回日期: 2019-12-23
- 網(wǎng)絡(luò)出版日期: 2019-12-31
- 刊出日期: 2020-06-04

Image-to-image Translation Based on Improved Cycle-consistent Generative Adversarial Network

Jinglei ZHANG^,,
Yawei HOU

1.
School of Electrical and Electronic Engineering, Tianjin University of Technology, Tianjin 300384, China
2.
Tianjin Key Laboratory of Complex System Control Theory and Application, Tianjin 300384, China

摘要

摘要:
圖像間的風(fēng)格遷移是一類將圖片在不同領(lǐng)域進(jìn)行轉(zhuǎn)換的方法。隨著生成式對抗網(wǎng)絡(luò)在深度學(xué)習(xí)中的快速發(fā)展，其在圖像風(fēng)格遷移領(lǐng)域中的應(yīng)用被日益關(guān)注。但經(jīng)典算法存在配對訓(xùn)練數(shù)據(jù)較難獲取，生成圖片效果差的缺點。該文提出一種改進(jìn)循環(huán)生成式對抗網(wǎng)絡(luò)(CycleGAN++)，取消了環(huán)形網(wǎng)絡(luò)，并在圖像生成階段將目標(biāo)域與源域的先驗信息與相應(yīng)圖片進(jìn)行縱深級聯(lián)；優(yōu)化了損失函數(shù)，采用分類損失代替循環(huán)一致?lián)p失，實現(xiàn)了不依賴訓(xùn)練數(shù)據(jù)映射的圖像風(fēng)格遷移。采用CelebA和Cityscapes數(shù)據(jù)集進(jìn)行實驗評測，結(jié)果表明在亞馬遜勞務(wù)平臺感知研究(AMT perceptual studies)與全卷積網(wǎng)絡(luò)得分(FCN score)兩個經(jīng)典測試指標(biāo)中，該文算法比CycleGAN, IcGAN, CoGAN, DIAT等經(jīng)典算法取得了更高的精度。
- 圖像風(fēng)格遷移 /
- 深度學(xué)習(xí) /
- 生成式對抗網(wǎng)絡(luò) /
- 損失函數(shù)
Abstract:
Image-to-image translation is a method to convert images in different domains. With the rapid development of the Generative Adversarial Network(GAN) in deep learning, GAN applications are increasingly concerned in the field of image-to-image translation. However, classical algorithms have disadvantages that the paired training data is difficult to obtain and the convert effect of generation image is poor. An improved Cycle-consistent Generative Adversarial Network(CycleGAN++) is proposed. New algorithm removes the loop network, and cascades the prior information of the target domain and the source domain in the image generation stage, The loss function is optimized as well, using classification loss instead of cycle consistency loss, realizing image-to-image translation without training data mapping. The evaluation of experiments on the CelebA and Cityscapes dataset show that new method can reach higher precision under the two classical criteria—Amazon Mechanical Turk perceptual studies(AMT perceptual studies) and Full-Convolutional Network score(FCN score), than the classical algorithms such as CycleGAN, IcGAN, CoGAN, and DIAT.
- Image-to-image translation /
- Deep learning /
- Generative Adversarial Network (GAN) /
- Loss function

HTML全文

圖 1 CycleGAN中單向GAN網(wǎng)絡(luò)結(jié)構(gòu)圖

下載: 全尺寸圖片幻燈片

圖 2 CycleGAN++的網(wǎng)絡(luò)結(jié)構(gòu)

下載: 全尺寸圖片幻燈片

圖 3 CycleGAN++的生成網(wǎng)絡(luò)

下載: 全尺寸圖片幻燈片

圖 4 CycleGAN++的判別網(wǎng)絡(luò)

下載: 全尺寸圖片幻燈片

圖 5 CycleGAN與CycleGAN++的訓(xùn)練過程對比

下載: 全尺寸圖片幻燈片

圖 6 CycleGAN++在人物性別轉(zhuǎn)換領(lǐng)域下的可視化結(jié)果

下載: 全尺寸圖片幻燈片

圖 7 CycleGAN++與原算法在CelebA測試集下的對比

下載: 全尺寸圖片幻燈片

圖 8 CycleGAN++與原算法在Cityscapes測試集下的對比

下載: 全尺寸圖片幻燈片

表 1 CycleGAN+與原算法的AMT測試結(jié)果對比(%)

方法	男性→女性	女性→男性	照片→標(biāo)簽	標(biāo)簽→照片
CycleGAN	24.6±2.3	21.1±1.8	26.8±2.8	23.2±3.4
CycleGAN+	29.5±3.2	29.2±4.1	27.8±2.2	28.2±2.4

下載: 導(dǎo)出CSV

表 2 CycleGAN+與原算法的FCN得分結(jié)果對比

方法	每像素精度	每類精度	IoU分類
CycleGAN	0.52	0.17	0.11
CycleGAN+	0.60	0.21	0.16

下載: 導(dǎo)出CSV

表 3 CycleGAN++與CycleGAN+的AMT感知研究結(jié)果對比(%)

方法	男性→女性	女性→男性	照片→標(biāo)簽	標(biāo)簽→照片
CycleGAN+	29.5±3.2	29.2±4.1	27.8±2.2	28.2±2.4
本文CycleGAN++	31.4±3.8	32.6±4.7	30.1±2.6	30.9±2.7

下載: 導(dǎo)出CSV

表 4 CycleGAN++與CycleGAN+的FCN得分結(jié)果對比

方法	每像素精度	每類精度	IoU分類
CycleGAN+	0.60	0.21	0.16
本文CycleGAN++	0.69	0.27	0.23

下載: 導(dǎo)出CSV

表 5 各算法的AMT感知研究結(jié)果對比(%)

方法	男性→女性	女性→男性	照片→標(biāo)簽	標(biāo)簽→照片
CycleGAN^[12]	24.6±2.3	21.1±1.8	26.8±2.8	23.2±3.4
IcGAN^[22]	23.2±2.5	22.4±2.9	22.8±2.6	19.8±1.9
CoGAN^[10]	6.8±1.1	5.1±0.9	0.6±0.5	0.9±0.5
DIAT^[21]	31.1±3.9	30.2±3.6	28.4±2.9	27.2±2.5
本文CycleGAN++	31.4±3.8	32.6±4.7	30.1±2.6	30.9±2.7

下載: 導(dǎo)出CSV

表 6 各算法的FCN得分結(jié)果對比

方法	每像素精度	每類精度	IoU分類
CycleGAN^[12]	0.52	0.17	0.11
IcGAN^[22]	0.43	0.11	0.07
CoGAN^[10]	0.40	0.10	0.06
DIAT^[21]	0.68	0.24	0.21
本文CycleGAN++	0.69	0.27	0.23

下載: 導(dǎo)出CSV

參考文獻(xiàn)(22)

HERTZMANN A, JACOBS C E, OLIVER N, et al. Image analogies[C]. The 28th Annual Conference on Computer Graphics and Interactive Techniques, New York, USA, 2001: 327–340. doi: 10.1145/383259.383295.

GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets[C]. The 27th International Conference on Neural Information Processing Systems, Montreal, Canada, 2014: 2672–2680.

RADFORD A, METZ L, and CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks[EB/OL]. https://arxiv.org/abs/1511.06434, 2015.

ARJOVSKY M, CHINTALA S, and BOTTOU L. Wasserstein GAN[EB/OL]. https://arxiv.org/abs/1701.07875, 2017.

GULRAJANI I, AHMED F, ARJOVSKY M, et al. Improved training of wasserstein GANs[C]. The 31st International Conference on Neural Information Processing Systems, Red Hook, USA, 2017: 5769–5779.

ISOLA P, ZHU Junyan, ZHOU Tinghui, et al. Image-to-image translation with conditional adversarial networks[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 5967–5976. doi: 10.1109/CVPR.2017.632.

MIRZA M and OSINDERO S. Conditional generative adversarial nets[EB/OL]. https://arxiv.org/abs/1411.1784, 2014.

ROSALES R, ACHAN K, and FREY B. Unsupervised image translation[C]. The 9th IEEE International Conference on Computer Vision, Nice, France, 2003: 472–478. doi: 10.1109/ICCV.2003.1238384.

LIU Mingyu, BREUEL T, KAUTZ J, et al. Unsupervised image-to-image translation networks[C]. The 31st Conference on Neural Information Processing Systems, Long Beach, USA, 2017: 700–708.

LIU Mingyu and TUZEL O. Coupled generative adversarial networks[C]. The 30th Conference on Neural Information Processing Systems, Barcelona, Spain, 2016: 469–477.

KINGMA D P and WELLING M. Auto-encoding variational bayes[EB/OL]. https://arxiv.org/abs/1312.6114, 2013.

ZHU Junyan, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]. 2017 IEEE International Conference on Computer Vision, Venice, Italy, 2017: 2242–2251. doi: 10.1109/ICCV.2017.244.

KIM T, CHA M, KIM H, et al. Learning to discover cross-domain relations with generative adversarial networks[C]. The 34th International Conference on Machine Learning, Sydney, Australia, 2017: 1857–1865.

YI Zili, ZHANG Hao, TAN Ping, et al. DualGAN: Unsupervised dual learning for image-to-image translation[C]. 2017 IEEE International Conference on Computer Vision, Venice, Italy, 2017: 2868–2876. doi: 10.1109/ICCV.2017.310.

BOUSMALIS K, SILBERMAN N, DOHAN D, et al. Unsupervised pixel-level domain adaptation with generative adversarial networks[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 95–104. doi: 10.1109/CVPR.2017.18.

SHRIVASTAVA A, PFISTER T, TUZEL O, et al. Learning from simulated and unsupervised images through adversarial training[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 2242–2251. doi: 10.1109/CVPR.2017.241.

TAIGMAN Y, POLYAK A, and Wolf L. Unsupervised cross-domain image generation[EB/OL]. https://arxiv.org/abs/1611.02200, 2016.

LI Chuan and WAND M. Precomputed real-time texture synthesis with markovian generative adversarial networks[C]. The 14th European Conference on Computer Vision, Amsterdam, The Netherlands, 2016: 702–716. doi: 10.1007/978-3-319-46487-9_43.

LIU Ziwei, LUO Ping, WANG Xiaogang, et al. Deep learning face attributes in the wild[C]. 2015 IEEE International Conference on Computer Vision, Santiago, Chile, 2015: 3730–3738. doi: 10.1109/ICCV.2015.425.

KINGMA D P and BA J. Adam: A method for stochastic optimization[EB/OL]. https://arxiv.org/abs/1412.6980, 2014.

LI Mu, ZUO Wangmeng, and ZHANG D. Deep identity-aware transfer of facial attributes[EB/OL]. https://arxiv.org/abs/1610.05586, 2016.

PERARNAU G, VAN DE WEIJER J, RADUCANU B, et al. Invertible conditional GANs for image editing[EB/OL]. https://arxiv.org/abs/1611.06355, 2016.

相關(guān)文章

施引文獻(xiàn)

資源附件(0)

訪問統(tǒng)計

圖(8) / 表(6)

計量

文章訪問數(shù): 5044
HTML全文瀏覽量: 2404
PDF下載量: 315
被引次數(shù): 0

姓名
郵箱
手機(jī)號碼
標(biāo)題
留言內(nèi)容
驗證碼

一级黄色片免费播放|中国黄色视频播放片|日本三级a|可以直接考播黄片影视免费一级毛片

留言板

基于改進(jìn)循環(huán)生成式對抗網(wǎng)絡(luò)的圖像風(fēng)格遷移

doi: 10.11999/JEIT190407

作者簡介:
張驚雷：男，1969，教授，博士，研究方向為模式識別、圖像處理等

厚雅偉：男，1995，碩士生，研究方向為圖像處理、目標(biāo)檢測等

通訊作者:
張驚雷　zhangjinglei@tjut.edu.cn

計量

Image-to-image Translation Based on Improved Cycle-consistent Generative Adversarial Network

計量

目錄

一级黄色片免费播放|中国黄色视频播放片|日本三级a|可以直接考播黄片影视免费一级毛片

留言板

基于改進(jìn)循環(huán)生成式對抗網(wǎng)絡(luò)的圖像風(fēng)格遷移

doi: 10.11999/JEIT190407

作者簡介: 張驚雷：男，1969，教授，博士，研究方向為模式識別、圖像處理等 厚雅偉：男，1995，碩士生，研究方向為圖像處理、目標(biāo)檢測等

通訊作者: 張驚雷 zhangjinglei@tjut.edu.cn

計量

出版歷程

Image-to-image Translation Based on Improved Cycle-consistent Generative Adversarial Network

計量

出版歷程

目錄

作者簡介:
張驚雷：男，1969，教授，博士，研究方向為模式識別、圖像處理等

厚雅偉：男，1995，碩士生，研究方向為圖像處理、目標(biāo)檢測等

通訊作者:
張驚雷　zhangjinglei@tjut.edu.cn