IMiRNA-PseDPC: MicroRNA precursor identification with a pseudo distance-pair composition approach

  • Bin Liu*
  • , Longyun Fang
  • , Fule Liu
  • , Xiaolong Wang
  • , Kuo Chen Chou
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

A microRNA (miRNA) is a small non-coding RNA molecule, functioning in transcriptional and post-transcriptional regulation of gene expression. The human genome may encode over 1000 miRNAs. Albeit poorly characterized, miRNAs are widely deemed as important regulators of biological processes. Aberrant expression of miRNAs has been observed in many cancers and other disease states, indicating that they are deeply implicated with these diseases, particularly in carcinogenesis. Therefore, it is important for both basic research and miRNA-based therapy to discriminate the real pre-miRNAs from the false ones (such as hairpin sequences with similar stem-loops). Particularly, with the avalanche of RNA sequences generated in the post-genomic age, it is highly desired to develop computational sequence-based methods for effectively identifying the human pre-miRNAs. Here, we propose a predictor called "iMiRNA-PseDPC", in which the RNA sequences are formulated by a novel feature vector called "pseudo distance-pair composition" (PseDPC) with 10 types of structure statuses. Rigorous cross-validations on a much larger and more stringent newly constructed benchmark data-set showed that our approach has remarkably outperformed the existing ones in either prediction accuracy or efficiency, indicating the new predictor is quite promising or at least may become a complementary tool to the existing predictors in this area. For the convenience of most experimental scientists, a user-friendly web server for the new predictor has been established at http://bioinformatics.hitsz.edu.cn/iMiRNA-PseDPC/, by which users can easily get their desired results without the need to go through the mathematical details. It is anticipated that the new predictor may become a useful high throughput tool for genome analysis particularly in dealing with large-scale data.

Original languageEnglish
Pages (from-to)220-232
Number of pages13
JournalJournal of Biomolecular Structure and Dynamics
Volume34
Issue number1
DOIs
Publication statusPublished - 2 Jan 2016
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Chou's PseAAC approach
  • Pre-miRNA
  • free energy
  • iMiRNA-PseDPC
  • large-scale analysis
  • local structure status

Fingerprint

Dive into the research topics of 'IMiRNA-PseDPC: MicroRNA precursor identification with a pseudo distance-pair composition approach'. Together they form a unique fingerprint.

Cite this