TY - JOUR
T1 - A comprehensive review and evaluation of computational methods for identifying protein complexes from protein-protein interaction networks
AU - Wu, Zhourun
AU - Liao, Qing
AU - Liu, Bin
N1 - Publisher Copyright:
© 2019 The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
PY - 2020/9/1
Y1 - 2020/9/1
N2 - Protein complexes are the fundamental units for many cellular processes. Identifying protein complexes accurately is critical for understanding the functions and organizations of cells. With the increment of genome-scale protein-protein interaction (PPI) data for different species, various computational methods focus on identifying protein complexes from PPI networks. In this article, we give a comprehensive and updated review on the state-of-the-art computational methods in the field of protein complex identification, especially focusing on the newly developed approaches. The computational methods are organized into three categories, including cluster-quality-based methods, node-affinity-based methods and ensemble clustering methods. Furthermore, the advantages and disadvantages of different methods are discussed, and then, the performance of 17 state-of-the-art methods is evaluated on two widely used benchmark data sets. Finally, the bottleneck problems and their potential solutions in this important field are discussed.
AB - Protein complexes are the fundamental units for many cellular processes. Identifying protein complexes accurately is critical for understanding the functions and organizations of cells. With the increment of genome-scale protein-protein interaction (PPI) data for different species, various computational methods focus on identifying protein complexes from PPI networks. In this article, we give a comprehensive and updated review on the state-of-the-art computational methods in the field of protein complex identification, especially focusing on the newly developed approaches. The computational methods are organized into three categories, including cluster-quality-based methods, node-affinity-based methods and ensemble clustering methods. Furthermore, the advantages and disadvantages of different methods are discussed, and then, the performance of 17 state-of-the-art methods is evaluated on two widely used benchmark data sets. Finally, the bottleneck problems and their potential solutions in this important field are discussed.
KW - cluster-quality-based methods
KW - ensemble clustering methods
KW - node-affinity-based methods
KW - protein complexes
KW - protein-protein interaction networks
UR - http://www.scopus.com/inward/record.url?scp=85087146222&partnerID=8YFLogxK
U2 - 10.1093/bib/bbz085
DO - 10.1093/bib/bbz085
M3 - Review article
C2 - 31631226
AN - SCOPUS:85087146222
SN - 1467-5463
VL - 21
SP - 1531
EP - 1548
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 5
ER -