PCSGAN: A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image

Sen He; Zehua Jian; Shaoli Liu; Jianhua Liu; Yue Fang; Jia Hu

doi:10.1109/TITS.2024.3368213

PCSGAN: A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image

Sen He, Zehua Jian, Shaoli Liu, Jianhua Liu, Yue Fang, Jia Hu

School of Mechanical Engineering

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

— Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.

Original language	English
Pages (from-to)	8796-8806
Number of pages	11
Journal	IEEE Transactions on Intelligent Transportation Systems
Volume	25
Issue number	8
DOIs	https://doi.org/10.1109/TITS.2024.3368213
Publication status	Published - 2024

Keywords

Deep learning
generative adversarial network
image generation
railway defect detection

Access to Document

10.1109/TITS.2024.3368213

Cite this

@article{adca16a93c274fe384a37bbba629b8cb,

title = "PCSGAN: A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image",

abstract = "— Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.",

keywords = "Deep learning, generative adversarial network, image generation, railway defect detection",

author = "Sen He and Zehua Jian and Shaoli Liu and Jianhua Liu and Yue Fang and Jia Hu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.",

year = "2024",

doi = "10.1109/TITS.2024.3368213",

language = "English",

volume = "25",

pages = "8796--8806",

journal = "IEEE Transactions on Intelligent Transportation Systems",

issn = "1524-9050",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "8",

}

TY - JOUR

T1 - PCSGAN

T2 - A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image

AU - He, Sen

AU - Jian, Zehua

AU - Liu, Shaoli

AU - Liu, Jianhua

AU - Fang, Yue

AU - Hu, Jia

PY - 2024

Y1 - 2024

N2 - — Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.

AB - — Many deep learning based railway defect detection methods have been proposed in recent years. They have greatly improved the efficiency and accuracy of defects detection. However, detection of railway defects remains challenging because of the limited number and types of defect samples, thus, general deep learning methods cannot be applied. In this paper, we designed a “Perceptually Constrained Single Image Generative Adversarial Network” (PCSGAN) to expand the number of railway defect image samples. PCSGAN uses a pyramidal structure to learn the internal features of a single image. In addition, we proposed a masking and a perceptual reconstruction loss mechanism to impose specific positional and structural constraints on the images. We tested the method using railway defects images and compared it to other single image generation models. The experiment results show that the images generated by PCSGAN take into account railway prior knowledge, generate railway structure which satisfied the constraints imposed by railway infrastructure designs, and also provide new information. High image realism and lowest Single Image FID were obtained, and the effectiveness of PCSGAN in the defect detection task were also validated.

KW - Deep learning

KW - generative adversarial network

KW - image generation

KW - railway defect detection

UR - http://www.scopus.com/inward/record.url?scp=85186963352&partnerID=8YFLogxK

U2 - 10.1109/TITS.2024.3368213

DO - 10.1109/TITS.2024.3368213

M3 - Article

AN - SCOPUS:85186963352

SN - 1524-9050

VL - 25

SP - 8796

EP - 8806

JO - IEEE Transactions on Intelligent Transportation Systems

JF - IEEE Transactions on Intelligent Transportation Systems

IS - 8

ER -

PCSGAN: A Perceptual Constrained Generative Model for Railway Defect Sample Expansion From a Single Image

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this