MulStack: An ensemble learning prediction model of multilabel mRNA subcellular localization

Ziqi Liu, Tao Bai, Bin Liu, Liang Yu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Subcellular localization of mRNA is related to protein synthesis, cell polarity, cell movement and other biological regulation mechanisms. The distribution of mRNAs in subcellulars is similar to that of proteins, and most mRNAs are distributed in multiple subcellulars. Recently, some computational methods have been designed to predict the subcellular localization of mRNA. However, these methods only employed a sin-gle level of mRNA features and did not employ the position encoding of nucleotides in mRNA. In this paper, an ensemble learning prediction model is proposed, named MulStack, which is based on random forest and deep learning for multilabel mRNA subcellular localization. The proposed method employs two levels of mRNA features, including sequence-level and residue-level features, and position encoding is employed for the first time in the field of subcellular localization of mRNA. Random forest is employed to learn mRNA sequence-level feature, deep learning is employed to learn mRNA sequence-level feature and mRNA residue-level combined with position encoding. And the outputs of random forest and deep learning model will be weighted sum as the prediction probability. Compared with existing methods, the results show that MulStack is the best in the localization of the nucleus, cytosol and exosome. In addition, position weight matrices (PWMs) are extracted by convolutional neural networks (CNNs) that can be matched with known RNA binding protein motifs. Gene ontology (GO) enrichment analysis shows biological processes, molecular functions and cellular components of mRNA genes. The prediction web server of MulStack is freely accessible at http://bliulab.net/MulStack.

Original languageEnglish
Article number108289
JournalComputers in Biology and Medicine
Volume175
DOIs
Publication statusPublished - Jun 2024

Keywords

  • Deep learning
  • Ensemble learning predictor
  • Multilabel mRNA subcellular localization
  • Position encoding
  • mRNA features at two levels

Fingerprint

Dive into the research topics of 'MulStack: An ensemble learning prediction model of multilabel mRNA subcellular localization'. Together they form a unique fingerprint.

Cite this