TY - GEN
T1 - Image Realism Enhancement Method Based on Improved Neural Neighbor Style Transfer
AU - Lin, Jiayi
AU - Chen, Wenjie
AU - Long, Zhiqi
AU - Yuan, Yu
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Object detection technology is a very important technique that relies on a large amount of data for training. However, obtaining sufficient data for specific detection targets can be challenging. In cases where data are limited, it is necessary to increase the number of datasets using various methods. We propose an improved algorithm based on the Neural Neighbor Style Transfer (NNST) algorithm to better adapt to the task of realistic style transfer. Through comparative experiments, we find that image realism could be enhanced when the alpha value is 0.75 and content loss, color correction, and high pixel output are used. Additionally, we design a more effective feature extraction network called SE-VGG19. Compared to the original VGG16, SE-VGG19 can improve the network’s ability to perceive style and extract features, making the generated images match the target style better while preserving the original content features. Furthermore, we suggest using the center cosine distance instead of the original Euclidean distance for loss measurement. After comparison and verification, our method has been proven to improve image realism compared to the original algorithm greatly.
AB - Object detection technology is a very important technique that relies on a large amount of data for training. However, obtaining sufficient data for specific detection targets can be challenging. In cases where data are limited, it is necessary to increase the number of datasets using various methods. We propose an improved algorithm based on the Neural Neighbor Style Transfer (NNST) algorithm to better adapt to the task of realistic style transfer. Through comparative experiments, we find that image realism could be enhanced when the alpha value is 0.75 and content loss, color correction, and high pixel output are used. Additionally, we design a more effective feature extraction network called SE-VGG19. Compared to the original VGG16, SE-VGG19 can improve the network’s ability to perceive style and extract features, making the generated images match the target style better while preserving the original content features. Furthermore, we suggest using the center cosine distance instead of the original Euclidean distance for loss measurement. After comparison and verification, our method has been proven to improve image realism compared to the original algorithm greatly.
KW - Image Realism Enhancement
KW - Image Style Transfer
KW - Neural Neighbor Style Transfer
KW - SE-VGG19
UR - http://www.scopus.com/inward/record.url?scp=86000757123&partnerID=8YFLogxK
U2 - 10.1109/CAC63892.2024.10865483
DO - 10.1109/CAC63892.2024.10865483
M3 - Conference contribution
AN - SCOPUS:86000757123
T3 - Proceedings - 2024 China Automation Congress, CAC 2024
SP - 622
EP - 627
BT - Proceedings - 2024 China Automation Congress, CAC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 China Automation Congress, CAC 2024
Y2 - 1 November 2024 through 3 November 2024
ER -