TY - GEN
T1 - Disease gene identification by walking on multilayer heterogeneous networks
AU - DIng, Cangfeng
AU - Li, Kan
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/5/8
Y1 - 2018/5/8
N2 - Identifying disease genes from a set of candidate genes is one of the main objectives in bioinformatics. Most of existing random walk algorithms to identify disease genes preferentially visit highlyconnected genes. Moreover, these algorithms access only a single gene network or an aggregated network of various gene data, leading to bias and incompleteness. To address these issues, we propose a topologically biased random walk with restart (BRWR) algorithm applied to multilayer-heterogeneous networks for the identification of disease genes. The BRWR by tuning the biased parameters can explore different layers of functional and physical interactions between proteins and genes, and can be conducted on heterogeneous networks in which the walkers can traverse a network with different types of nodes and edges. Experimental results show that the BRWR algorithm to identify candidate disease genes outperforms existing ones. Finally, the BRWR algorithm on multilayer-heterogeneous networks is used to predict disease genes implicated in the undiagnosed neonatal progeroid syndrome. Overall, by a proper tuning of the biases in the walks, our algorithm on different interaction sources can effectively improve the performance of candidate disease gene identification.
AB - Identifying disease genes from a set of candidate genes is one of the main objectives in bioinformatics. Most of existing random walk algorithms to identify disease genes preferentially visit highlyconnected genes. Moreover, these algorithms access only a single gene network or an aggregated network of various gene data, leading to bias and incompleteness. To address these issues, we propose a topologically biased random walk with restart (BRWR) algorithm applied to multilayer-heterogeneous networks for the identification of disease genes. The BRWR by tuning the biased parameters can explore different layers of functional and physical interactions between proteins and genes, and can be conducted on heterogeneous networks in which the walkers can traverse a network with different types of nodes and edges. Experimental results show that the BRWR algorithm to identify candidate disease genes outperforms existing ones. Finally, the BRWR algorithm on multilayer-heterogeneous networks is used to predict disease genes implicated in the undiagnosed neonatal progeroid syndrome. Overall, by a proper tuning of the biases in the walks, our algorithm on different interaction sources can effectively improve the performance of candidate disease gene identification.
KW - Biased random walks
KW - Biological networks
KW - Heterogeneous networks
KW - Multilayer networks
UR - http://www.scopus.com/inward/record.url?scp=85052241938&partnerID=8YFLogxK
U2 - 10.1145/3203217.3203275
DO - 10.1145/3203217.3203275
M3 - Conference contribution
AN - SCOPUS:85052241938
SN - 9781450357616
T3 - 2018 ACM International Conference on Computing Frontiers, CF 2018 - Proceedings
SP - 11
EP - 18
BT - 2018 ACM International Conference on Computing Frontiers, CF 2018 - Proceedings
PB - Association for Computing Machinery, Inc
T2 - 15th ACM International Conference on Computing Frontiers, CF 2018
Y2 - 8 May 2018 through 10 May 2018
ER -