TY - JOUR
T1 - PreRBP-TL
T2 - prediction of species-specific RNA-binding proteins based on transfer learning
AU - Zhang, Jun
AU - Yan, Ke
AU - Chen, Qingcai
AU - Liu, Bin
N1 - Publisher Copyright:
© 2022 The Author(s) 2022. Published by Oxford University Press. All rights reserved.
PY - 2022/4/15
Y1 - 2022/4/15
N2 - Motivation: RNA-binding proteins (RBPs) play crucial roles in post-transcriptional regulation. Accurate identification of RBPs helps to understand gene expression, regulation, etc. In recent years, some computational methods were proposed to identify RBPs. However, these methods fail to accurately identify RBPs from some specific species with limited data, such as bacteria. Results: In this study, we introduce a computational method called PreRBP-TL for identifying species-specific RBPs based on transfer learning. The weights of the prediction model were initialized by pretraining with the large general RBP dataset and then fine-tuned with the small species-specific RPB dataset by using transfer learning. The experimental results show that the PreRBP-TL achieves better performance for identifying the species-specific RBPs from Human, Arabidopsis, Escherichia coli and Salmonella, outperforming eight state-of-the-art computational methods. It is anticipated PreRBP-TL will become a useful method for identifying RBPs.
AB - Motivation: RNA-binding proteins (RBPs) play crucial roles in post-transcriptional regulation. Accurate identification of RBPs helps to understand gene expression, regulation, etc. In recent years, some computational methods were proposed to identify RBPs. However, these methods fail to accurately identify RBPs from some specific species with limited data, such as bacteria. Results: In this study, we introduce a computational method called PreRBP-TL for identifying species-specific RBPs based on transfer learning. The weights of the prediction model were initialized by pretraining with the large general RBP dataset and then fine-tuned with the small species-specific RPB dataset by using transfer learning. The experimental results show that the PreRBP-TL achieves better performance for identifying the species-specific RBPs from Human, Arabidopsis, Escherichia coli and Salmonella, outperforming eight state-of-the-art computational methods. It is anticipated PreRBP-TL will become a useful method for identifying RBPs.
UR - http://www.scopus.com/inward/record.url?scp=85128718876&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btac106
DO - 10.1093/bioinformatics/btac106
M3 - Article
AN - SCOPUS:85128718876
SN - 1367-4803
VL - 38
SP - 2135
EP - 2143
JO - Bioinformatics
JF - Bioinformatics
IS - 8
ER -