BioSeq-Analysis: A platform for DNA, RNA and protein sequence analysis based on machine learning approaches

Bin Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

242 Citations (Scopus)

Abstract

With the avalanche of biological sequences generated in the post-genomic age, one of the most challenging problems is how to computationally analyze their structures and functions. Machine learning techniques are playing key roles in this field. Typically, predictors based on machine learning techniques contain three main steps: feature extraction, predictor construction and performance evaluation. Although several Web servers and stand-alone tools have been developed to facilitate the biological sequence analysis, they only focus on individual step. In this regard, in this study a powerful Web server called BioSeq-Analysis (http://bioinformatics.hitsz.edu.cn/BioSeq-Analysis/) has been proposed to automatically complete the three main steps for constructing a predictor. The user only needs to upload the benchmark data set. BioSeq-Analysis can generate the optimized predictor based on the benchmark data set, and the performance measures can be reported as well. Furthermore, to maximize user's convenience, its stand-alone program was also released, which can be downloaded from http://bioinformatics.hitsz.edu.cn/BioSeq-Analysis/download/, and can be directly run on Windows, Linux and UNIX. Applied to three sequence analysis tasks, experimental results showed that the predictors generated by BioSeq-Analysis even outperformed some state-of-the-art methods. It is anticipated that BioSeq-Analysis will become a useful tool for biological sequence analysis.

Original languageEnglish
Pages (from-to)1280-1294
Number of pages15
JournalBriefings in Bioinformatics
Volume20
Issue number4
DOIs
Publication statusPublished - 27 Mar 2018
Externally publishedYes

Keywords

  • biological sequence analysis
  • feature extraction
  • machine learning
  • performance evaluation
  • predictor construction

Fingerprint

Dive into the research topics of 'BioSeq-Analysis: A platform for DNA, RNA and protein sequence analysis based on machine learning approaches'. Together they form a unique fingerprint.

Cite this