TY - GEN
T1 - A Review of Machine Learning Algorithms for Text Classification
AU - Li, Ruiguang
AU - Liu, Ming
AU - Xu, Dawei
AU - Gao, Jiaqi
AU - Wu, Fudong
AU - Zhu, Liehuang
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022
Y1 - 2022
N2 - Text classification is a basic task in the field of natural language processing, and it is a basic technology for information retrieval, questioning and answering system, emotion analysis and other advanced tasks. It is one of the earliest application of machine learning algorithm, and has achieved good results. In this paper, we made a review of the traditional and state-of-the-art machine learning algorithms for text classification, such as Naive Bayes, Supporting Vector Machine, Decision Tree, K Nearest Neighbor, Random Forest and neural networks. Then, we discussed the advantages and disadvantages of all kinds of machine learning algorithms in depth. Finally, we made a summary that neural networks and deep learning will become the main research topic in the future.
AB - Text classification is a basic task in the field of natural language processing, and it is a basic technology for information retrieval, questioning and answering system, emotion analysis and other advanced tasks. It is one of the earliest application of machine learning algorithm, and has achieved good results. In this paper, we made a review of the traditional and state-of-the-art machine learning algorithms for text classification, such as Naive Bayes, Supporting Vector Machine, Decision Tree, K Nearest Neighbor, Random Forest and neural networks. Then, we discussed the advantages and disadvantages of all kinds of machine learning algorithms in depth. Finally, we made a summary that neural networks and deep learning will become the main research topic in the future.
KW - Machine learning
KW - Natural language processing
KW - Neural network
KW - Text classification
UR - http://www.scopus.com/inward/record.url?scp=85124658477&partnerID=8YFLogxK
U2 - 10.1007/978-981-16-9229-1_14
DO - 10.1007/978-981-16-9229-1_14
M3 - Conference contribution
AN - SCOPUS:85124658477
SN - 9789811692284
T3 - Communications in Computer and Information Science
SP - 226
EP - 234
BT - Cyber Security - 18th China Annual Conference, CNCERT 2021, Revised Selected Papers
A2 - Lu, Wei
A2 - Zhang, Yuqing
A2 - Wen, Weiping
A2 - Yan, Hanbing
A2 - Li, Chao
PB - Springer Science and Business Media Deutschland GmbH
T2 - 18th China Cyber Security Annual Conference, CNCERT 2021
Y2 - 20 July 2021 through 21 July 2021
ER -