IDHS-EL: Identifying DNase i hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework

  • Bin Liu*
  • , Ren Long
  • , Kuo Chen Chou
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

202 Citations (Scopus)

Abstract

Motivation: Regulatory DNA elements are associated with DNase I hypersensitive sites (DHSs). Accordingly, identification of DHSs will provide useful insights for in-depth investigation into the function of noncoding genomic regions. Results: In this study, using the strategy of ensemble learning framework, we proposed a new predictor called iDHS-EL for identifying the location of DHS in human genome. It was formed by fusing three individual Random Forest (RF) classifiers into an ensemble predictor. The three RF operators were respectively based on the three special modes of the general pseudo nucleotide composition (PseKNC): (i) kmer, (ii) reverse complement kmer and (iii) pseudo dinucleotide composition. It has been demonstrated that the new predictor remarkably outperforms the relevant state-of-the-art methods in both accuracy and stability.

Original languageEnglish
Pages (from-to)2411-2418
Number of pages8
JournalBioinformatics
Volume32
Issue number16
DOIs
Publication statusPublished - 15 Aug 2016
Externally publishedYes

Fingerprint

Dive into the research topics of 'IDHS-EL: Identifying DNase i hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework'. Together they form a unique fingerprint.

Cite this