TY - GEN
T1 - Deep embedding GAN-based model for anomaly detection on high-dimensional sparse data
AU - Wang, Chaojun
AU - Dai, Yaping
AU - Dai, Wei
N1 - Publisher Copyright:
© 2019 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2019/7
Y1 - 2019/7
N2 - The use of Generative Adversarial Nets (GAN) for anomaly detection has been explored recently. However, in the case of high-dimensional sparse data, existing GAN-based anomaly detection models suffer from inefficient dimensionality reduction, computationally costly data reconstruction, and suboptimal performance limited by the training objective. In this paper, a deep embedding GAN-based model is developed for anomaly detection on high-dimensional sparse data. In the model, dimensionality reduction of input data is performed by embeddings efficiently. With the bidirectional Wasserstein GAN, data reconstruction is conducted in the input dense representation space at a low computational cost. The objective function defined by the Wasserstein distance and Lipschitz continuity constraints stabilizes training and improves model performance. Experimental results on public datasets show that, the developed model has comparable or superior performance over the competing techniques, and achieves up to 8.81% relative improvement based on the area under the Receiver Operating Characteristics curve (AUC).
AB - The use of Generative Adversarial Nets (GAN) for anomaly detection has been explored recently. However, in the case of high-dimensional sparse data, existing GAN-based anomaly detection models suffer from inefficient dimensionality reduction, computationally costly data reconstruction, and suboptimal performance limited by the training objective. In this paper, a deep embedding GAN-based model is developed for anomaly detection on high-dimensional sparse data. In the model, dimensionality reduction of input data is performed by embeddings efficiently. With the bidirectional Wasserstein GAN, data reconstruction is conducted in the input dense representation space at a low computational cost. The objective function defined by the Wasserstein distance and Lipschitz continuity constraints stabilizes training and improves model performance. Experimental results on public datasets show that, the developed model has comparable or superior performance over the competing techniques, and achieves up to 8.81% relative improvement based on the area under the Receiver Operating Characteristics curve (AUC).
KW - Anomaly Detection
KW - Generative Adversarial Nets
KW - High-dimensional Sparse Data
UR - http://www.scopus.com/inward/record.url?scp=85074404377&partnerID=8YFLogxK
U2 - 10.23919/ChiCC.2019.8866256
DO - 10.23919/ChiCC.2019.8866256
M3 - Conference contribution
AN - SCOPUS:85074404377
T3 - Chinese Control Conference, CCC
SP - 8718
EP - 8722
BT - Proceedings of the 38th Chinese Control Conference, CCC 2019
A2 - Fu, Minyue
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 38th Chinese Control Conference, CCC 2019
Y2 - 27 July 2019 through 30 July 2019
ER -