Research on the multi-source causal feature selection method based on multiple causal relevance

Ping Qiu, Zhendong Niu*, Chunxia Zhang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Multi-source causal feature selection captures causal relevance of the features with the class attribute in different datasets and are very important to improve the stability and reliability of prediction models. The Multi-source Causal Feature Selection (MCFS) is the most advanced method that can simultaneously select features on multiple datasets. However, it only considers the causal relevance between a single feature and class attributes, which ignores the causal relevance among multiple features. In addition, MCFS uses exhaustive method to obtain the optimal causal feature set on multiple datasets, which is time-consuming. Focusing on the two problems, firstly we propose the Multiple Causal Relevance, which can remove redundant information hidden in pairwise causal relevance. Secondly, we analyze the Markov blanket of multi-source class attributes, where the upper and lower bounds of optimal causal feature set are proven to reduce the search range of features and improve the efficiency of the algorithm. Finally, we propose a multi-source causal Feature Selection method based on Multiple Causal Relevance (MCRFS) and use synthetic datasets and binary and multiclassification real datasets with 2 feature selection methods, extensive experiments show that the accuracy and efficiency of MCRFS method on SVM and KNN classifiers are better than two comparison methods.

源语言英语
文章编号110334
期刊Knowledge-Based Systems
265
DOI
出版状态已出版 - 8 4月 2023

指纹

探究 'Research on the multi-source causal feature selection method based on multiple causal relevance' 的科研主题。它们共同构成独一无二的指纹。

引用此