TY - JOUR
T1 - PCSGAN
T2 - A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image
AU - He, Sen
AU - Jian, Zehua
AU - Liu, Shaoli
AU - Liu, Jianhua
AU - Fang, Yue
AU - Hu, Jia
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - — Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.
AB - — Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.
KW - Deep learning
KW - generative adversarial network
KW - image generation
KW - railway defect detection
UR - http://www.scopus.com/inward/record.url?scp=85186963352&partnerID=8YFLogxK
U2 - 10.1109/TITS.2024.3368213
DO - 10.1109/TITS.2024.3368213
M3 - Article
AN - SCOPUS:85186963352
SN - 1524-9050
VL - 25
SP - 8796
EP - 8806
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 8
ER -