Dual-phase feature selection using adaptive neighborhood rough sets and hybrid sine-cosine optimization for classification

  • Chengfeng Zheng
  • , Mohd Shareduwan Mohd Kasihmuddin*
  • , Zhizhong Yan
  • , Mohd Asyraf Mansor
  • , Yuan Gao
  • , Ju Chen
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

High-dimensional, multi-class, and imbalanced datasets present significant challenges in classification tasks across various industries, including healthcare, finance, and image processing. Existing feature selection methods, particularly those based on neighborhood rough sets, often struggle with handling both feature redundancy and noisy samples, making it difficult to capture the complex distribution of features and samples across different classes. To address this, we propose a dual-phase feature selection method that performs joint optimization in both horizontal (feature-level) and vertical (sample-level) dimensions. In the first phase, adaptive neighborhood rough set theory is used for horizontal feature selection. By adjusting the neighborhood radius (δ) and inclusion degree (λ) through cross-validation, the method selects relevant feature subsets tailored to the granularity of each dataset, thereby improving generalization. In the second phase, a hybrid sine cosine algorithm is employed for vertical processing to optimize sample selection. This algorithm iteratively removes noisy or misleading samples based on fitness evaluation, enhancing the model's robustness. Furthermore, the framework integrates an enhanced fuzzy k-nearest neighbor classifier that leverages feature subset weights for each class to better address class imbalance during classification. Extensive experiments on 21 public datasets, using three types of classifiers, show that the proposed method outperforms seven benchmark feature selection algorithms in terms of classification accuracy, weighted precision, weighted recall, and weighted F1-score. Statistical tests, including the Wilcoxon signed-rank test, confirm significant improvements. This dual-phase horizontal and vertical optimization approach offers a robust and effective solution for real-world classification tasks involving complex data distributions.

Original languageEnglish
Article number111899
JournalEngineering Applications of Artificial Intelligence
Volume160
DOIs
Publication statusPublished - 23 Nov 2025
Externally publishedYes

Keywords

  • Adaptive rough set feature selection
  • Artificial intelligence applications
  • Fuzzy k-nearest neighbor
  • High-dimensional data
  • Multi-class classification
  • Sine cosine algorithm

Fingerprint

Dive into the research topics of 'Dual-phase feature selection using adaptive neighborhood rough sets and hybrid sine-cosine optimization for classification'. Together they form a unique fingerprint.

Cite this