TY - GEN
T1 - Generating peptide sequence tags for peptide identification via tandem mass spectrometry
AU - Changyong, Yu
AU - Guoren, Wang
AU - Yuhai, Zhao
AU - Keming, Mao
AU - Junjie, Wu
AU - Wendan, Zhai
PY - 2009
Y1 - 2009
N2 - Large-scale, rapid and accurate protein identi-fication is the crucial basis for further protein analysis in computational proteomics. Searching protein database by use of the protein tandem mass spectra has been a standard solution for solving this problem. Though several algorithms have been proposed, more sensitive and accurate approaches are still needed. In this paper, an effective database search approach is proposed. Prior to searching sequence database, an approach based on a graph-theoretic model is proposed to infer the peptide sequence tag (PST) from the tandem mass spectra data which is the partial sequence of the peptide. Also, an index approach for the protein sequence database is proposed for speeding up the database search and filtering out the incorrect protein sequences. Then, a novel scoring method for evaluating the match between the peptide sequence tag and the protein sequence is proposed for improving the accuracy of the database search result. Finally, we develop an algorithm for solving the problem and implement it as a computer program PepCheck. All the results of PepCheck are compared with those of the famous algorithms. Experimental results demonstrate that PepCheck is as accurate as or more accurate than them with the test datasets.
AB - Large-scale, rapid and accurate protein identi-fication is the crucial basis for further protein analysis in computational proteomics. Searching protein database by use of the protein tandem mass spectra has been a standard solution for solving this problem. Though several algorithms have been proposed, more sensitive and accurate approaches are still needed. In this paper, an effective database search approach is proposed. Prior to searching sequence database, an approach based on a graph-theoretic model is proposed to infer the peptide sequence tag (PST) from the tandem mass spectra data which is the partial sequence of the peptide. Also, an index approach for the protein sequence database is proposed for speeding up the database search and filtering out the incorrect protein sequences. Then, a novel scoring method for evaluating the match between the peptide sequence tag and the protein sequence is proposed for improving the accuracy of the database search result. Finally, we develop an algorithm for solving the problem and implement it as a computer program PepCheck. All the results of PepCheck are compared with those of the famous algorithms. Experimental results demonstrate that PepCheck is as accurate as or more accurate than them with the test datasets.
UR - http://www.scopus.com/inward/record.url?scp=70449576120&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2009.16
DO - 10.1109/BIBE.2009.16
M3 - Conference contribution
AN - SCOPUS:70449576120
SN - 9780769536569
T3 - Proceedings of the 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009
SP - 200
EP - 207
BT - Proceedings of the 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009
T2 - 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009
Y2 - 22 June 2009 through 24 June 2009
ER -