Abstract
As the scale of data shows rapid growth in various fields, big data's vast amount of information can facilitate scientific discovery or decision-making. Deep neural network prevails in modeling big data such as images and text in computer vision and natural language processing. However, there is currently no widespread deep neural network for high-dimensional tabular data (HTD), as HTD could increase the model's complexity and make estimating the parameters more difficult. Therefore, this paper proposes CLDNSR, a contrastive learning-enhanced deep neural network with serial regularization. This method combines relaxed Bernoulli distribution-based L0 regularization and adaptive L2 regularization for important feature selection and adaptive redundancy control to effectively handle high-dimensional input features. In addition, a tabular contrastive pre-training method is proposed to stabilize the supervised training process through better parameter initialization. Experiments on eleven real-world high-dimensional tabular datasets demonstrate that CLDNSR outperforms the baseline models designed for high-dimensional data.
| Original language | English |
|---|---|
| Article number | 120243 |
| Journal | Expert Systems with Applications |
| Volume | 228 |
| DOIs | |
| Publication status | Published - 15 Oct 2023 |
Keywords
- Contrastive learning
- Deep neural network
- High-dimensional tabular data
- Serial regularization
Fingerprint
Dive into the research topics of 'Contrastive learning enhanced deep neural network with serial regularization for high-dimensional tabular data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver