The prediction of human genes in DNA based on a generalized hidden Markov model

Rui Guo*, Ke Yan, Wei He, Jian Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The Generalized Hidden Markov Model (GHMM) has been proved to be an excellently general probabilistic model of the gene structure of human genomic sequences. It can simultaneously incorporate different signal descriptions like splicing sites and content descriptions, for instance, compositional features of exons and introns. Enjoying its flexibility and convincing probabilistic underpinnings, we integrate some other modification of submodels and then implement a prediction program of Human Genes in DNA. The program has the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. More importantly, it also can perform well for longer sequences with an unknown number of genes in them. In the experiments, the results show that the proposed method has better performance in prediction accuracy than some existing methods, and over 70% of exons can be identified exactly.

Original languageEnglish
Title of host publicationBiometric Recognition - 11th Chinese Conference, CCBR 2016, Proceedings
EditorsShiguang Shan, Zhisheng You, Jie Zhou, Weishi Zheng, Yunhong Wang, Zhenan Sun, Jianjiang Feng, Qijun Zhao
PublisherSpringer Verlag
Pages747-755
Number of pages9
ISBN (Print)9783319466538
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event11th Chinese Conference on Biometric Recognition, CCBR 2016 - Chengdu, China
Duration: 14 Oct 201616 Oct 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9967 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th Chinese Conference on Biometric Recognition, CCBR 2016
Country/TerritoryChina
CityChengdu
Period14/10/1616/10/16

Keywords

  • GHMM
  • Gene prediction
  • IMM
  • The method based on similarity weighting of sequence patterns
  • The prefix sum arrays
  • WWAM

Fingerprint

Dive into the research topics of 'The prediction of human genes in DNA based on a generalized hidden Markov model'. Together they form a unique fingerprint.

Cite this