Identification of DNA-binding proteins by auto-cross covariance transformation

Qiwen Dong, Shanyi Wang, Kai Wang, Xuan Liu, Bin Liu*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Citations (Scopus)

Abstract

DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences are unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding protein from the protein sequence information. In this study, a novel method is presented which combines the support vector machine and the auto-cross covariance transformation. The protein sequence represented in the form of amino acids or the physical-chemical properties of amino acids are converted into a series of fixed-length vectors by Kmer composition and the auto-cross covariance transformation. The sequence order effect can be effectively capture by this scheme. These vectors are then inputted to support vector machine to discriminate the DNA-binding proteins from the non DNA-binding ones. The proposed method achieves the overall accuracy of 75.23% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. The independent test shows that the proposed method outperforms most of the existing methods. These results demonstrate that the proposed method provides the state-of-the-art performance for the prediction of DNA-binding proteins.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
Editorslng. Matthieu Schapranow, Jiayu Zhou, Xiaohua Tony Hu, Bin Ma, Sanguthevar Rajasekaran, Satoru Miyano, Illhoi Yoo, Brian Pierce, Amarda Shehu, Vijay K. Gombar, Brian Chen, Vinay Pai, Jun Huan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages470-475
Number of pages6
ISBN (Electronic)9781467367981
DOIs
Publication statusPublished - 16 Dec 2015
Externally publishedYes
EventIEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 - Washington, United States
Duration: 9 Nov 201512 Nov 2015

Publication series

NameProceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015

Conference

ConferenceIEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
Country/TerritoryUnited States
CityWashington
Period9/11/1512/11/15

Keywords

  • DNA-binding protein
  • auto-cross covariance transformation
  • support vector machine

Fingerprint

Dive into the research topics of 'Identification of DNA-binding proteins by auto-cross covariance transformation'. Together they form a unique fingerprint.

Cite this