Multi-style image generation based on semantic image

Yue Yu; Ding Li; Benyuan Li; Nengli Li

doi:10.1007/s00371-023-03042-2

Multi-style image generation based on semantic image

Yue Yu^*, Ding Li, Benyuan Li, Nengli Li

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

5 Citations (Scopus)

Abstract

Image generation has always been one of the important research directions in the field of computer vision. It has rich applications in virtual reality, image design, and video synthesis. Our experiments proved that the proposed multi-style image generative network can efficiently generate high-quality images with different artistic styles based on the semantic images. Compared with the current state-of-the-art methods, the result generation speed of our proposed method is the fastest. In this paper, we focus on implementing arbitrary style transfer based on semantic images with high resolution (512×1024). We propose a new multi-channel generative adversarial network which uses fewer parameters to generate multi-style images. The network framework consists of a content feature extraction network, a style feature extraction network, and a content-stylistic feature fusion network. Our qualitative experiments show that the proposed multi-style image generation network can efficiently generate semantic-based, high-quality images with multiple artistic styles and with greater clarity and richer details. We adopt a user preference study, and the results show that the results generated by our method are more popular. Our speed study shows that our proposed method has the fastest result generation speed compared to the current state-of-the-art methods. We publicly release the source code of our project, which can be accessed at https://github.com/JuanMaoHSQ/Multi-style-image-generation-based-on-semantic-image.

Original language	English
Pages (from-to)	3411-3426
Number of pages	16
Journal	Visual Computer
Volume	40
Issue number	5
DOIs	https://doi.org/10.1007/s00371-023-03042-2
Publication status	Published - May 2024

Keywords

Generative adversarial network
Image generation
Style transfer

Access to Document

10.1007/s00371-023-03042-2

Cite this

Yu, Y., Li, D., Li, B., & Li, N. (2024). Multi-style image generation based on semantic image. Visual Computer, 40(5), 3411-3426. https://doi.org/10.1007/s00371-023-03042-2

@article{0aa0c1bdc3004341ae8b580388c44279,

title = "Multi-style image generation based on semantic image",

abstract = "Image generation has always been one of the important research directions in the field of computer vision. It has rich applications in virtual reality, image design, and video synthesis. Our experiments proved that the proposed multi-style image generative network can efficiently generate high-quality images with different artistic styles based on the semantic images. Compared with the current state-of-the-art methods, the result generation speed of our proposed method is the fastest. In this paper, we focus on implementing arbitrary style transfer based on semantic images with high resolution (512×1024). We propose a new multi-channel generative adversarial network which uses fewer parameters to generate multi-style images. The network framework consists of a content feature extraction network, a style feature extraction network, and a content-stylistic feature fusion network. Our qualitative experiments show that the proposed multi-style image generation network can efficiently generate semantic-based, high-quality images with multiple artistic styles and with greater clarity and richer details. We adopt a user preference study, and the results show that the results generated by our method are more popular. Our speed study shows that our proposed method has the fastest result generation speed compared to the current state-of-the-art methods. We publicly release the source code of our project, which can be accessed at https://github.com/JuanMaoHSQ/Multi-style-image-generation-based-on-semantic-image.",

keywords = "Generative adversarial network, Image generation, Style transfer",

author = "Yue Yu and Ding Li and Benyuan Li and Nengli Li",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.",

year = "2024",

month = may,

doi = "10.1007/s00371-023-03042-2",

language = "English",

volume = "40",

pages = "3411--3426",

journal = "Visual Computer",

issn = "0178-2789",

publisher = "Springer Verlag",

number = "5",

}

TY - JOUR

T1 - Multi-style image generation based on semantic image

AU - Yu, Yue

AU - Li, Ding

AU - Li, Benyuan

AU - Li, Nengli

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.

PY - 2024/5

Y1 - 2024/5

N2 - Image generation has always been one of the important research directions in the field of computer vision. It has rich applications in virtual reality, image design, and video synthesis. Our experiments proved that the proposed multi-style image generative network can efficiently generate high-quality images with different artistic styles based on the semantic images. Compared with the current state-of-the-art methods, the result generation speed of our proposed method is the fastest. In this paper, we focus on implementing arbitrary style transfer based on semantic images with high resolution (512×1024). We propose a new multi-channel generative adversarial network which uses fewer parameters to generate multi-style images. The network framework consists of a content feature extraction network, a style feature extraction network, and a content-stylistic feature fusion network. Our qualitative experiments show that the proposed multi-style image generation network can efficiently generate semantic-based, high-quality images with multiple artistic styles and with greater clarity and richer details. We adopt a user preference study, and the results show that the results generated by our method are more popular. Our speed study shows that our proposed method has the fastest result generation speed compared to the current state-of-the-art methods. We publicly release the source code of our project, which can be accessed at https://github.com/JuanMaoHSQ/Multi-style-image-generation-based-on-semantic-image.

AB - Image generation has always been one of the important research directions in the field of computer vision. It has rich applications in virtual reality, image design, and video synthesis. Our experiments proved that the proposed multi-style image generative network can efficiently generate high-quality images with different artistic styles based on the semantic images. Compared with the current state-of-the-art methods, the result generation speed of our proposed method is the fastest. In this paper, we focus on implementing arbitrary style transfer based on semantic images with high resolution (512×1024). We propose a new multi-channel generative adversarial network which uses fewer parameters to generate multi-style images. The network framework consists of a content feature extraction network, a style feature extraction network, and a content-stylistic feature fusion network. Our qualitative experiments show that the proposed multi-style image generation network can efficiently generate semantic-based, high-quality images with multiple artistic styles and with greater clarity and richer details. We adopt a user preference study, and the results show that the results generated by our method are more popular. Our speed study shows that our proposed method has the fastest result generation speed compared to the current state-of-the-art methods. We publicly release the source code of our project, which can be accessed at https://github.com/JuanMaoHSQ/Multi-style-image-generation-based-on-semantic-image.

KW - Generative adversarial network

KW - Image generation

KW - Style transfer

UR - http://www.scopus.com/inward/record.url?scp=85167513628&partnerID=8YFLogxK

U2 - 10.1007/s00371-023-03042-2

DO - 10.1007/s00371-023-03042-2

M3 - Article

AN - SCOPUS:85167513628

SN - 0178-2789

VL - 40

SP - 3411

EP - 3426

JO - Visual Computer

JF - Visual Computer

IS - 5

ER -

Multi-style image generation based on semantic image

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this