TY - CHAP
T1 - Applications of Natural Language Processing Techniques in Protein Structure and Function Prediction
AU - Liu, Bin
AU - Yan, Ke
AU - Pang, Yi He
AU - Zhang, Jun
AU - Shao, Jiang Yi
AU - Tang, Yi Jun
AU - Wang, Ning
N1 - Publisher Copyright:
© 2023 World Scientific Publishing Company.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - Protein structure and function prediction are instrumental areas in the bioinformatics field. They are important for a number of applications in rational drug discovery, disease analysis, and many others. Protein sequences and natural languages share some similarities. Therefore, many techniques derived from natural language processing (NLP) have been applied to the protein structure and function prediction. In this chapter, we discuss sequence-based predictors of protein structure and function that utilize techniques derived from NLP field. We include methods that target protein sequence analysis, fold recognition, identification of intrinsically disordered regions/proteins, and prediction of protein-nucleic acids binding. The concepts and computational methods discussed in this chapter will be especially useful for the researchers who are working in the related field. We also aim to bring new computational NLP techniques into the protein structure and function prediction area.
AB - Protein structure and function prediction are instrumental areas in the bioinformatics field. They are important for a number of applications in rational drug discovery, disease analysis, and many others. Protein sequences and natural languages share some similarities. Therefore, many techniques derived from natural language processing (NLP) have been applied to the protein structure and function prediction. In this chapter, we discuss sequence-based predictors of protein structure and function that utilize techniques derived from NLP field. We include methods that target protein sequence analysis, fold recognition, identification of intrinsically disordered regions/proteins, and prediction of protein-nucleic acids binding. The concepts and computational methods discussed in this chapter will be especially useful for the researchers who are working in the related field. We also aim to bring new computational NLP techniques into the protein structure and function prediction area.
UR - http://www.scopus.com/inward/record.url?scp=85153654184&partnerID=8YFLogxK
U2 - 10.1142/9789811258589_0003
DO - 10.1142/9789811258589_0003
M3 - Chapter
AN - SCOPUS:85153654184
SN - 9789811258572
SP - 57
EP - 80
BT - Machine Learning in Bioinformatics of Protein Sequences
PB - World Scientific Publishing Co.
ER -