TY - GEN
T1 - Financial named entity recognition based on conditional random fields and information entropy
AU - Wang, Shuwei
AU - Xu, Ruifeng
AU - Liu, Bin
AU - Gui, Lin
AU - Zhou, Yu
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/1/13
Y1 - 2014/1/13
N2 - Named entity recognition plays an important role in many natural language processing tasks, such as relation detection and information extraction. This paper presents a novel method to recognize named entities infinancial news texts in three steps. First, the domain dictionary is applied to recognize stock names. Second, the full form FNEs are identified by incorporating internal features in a classifier based on Conditional Random Fields. Third, the mutual information, boundary entropy and context features are employed to recognize the abbreviation FNE candidates. The experiments completed on a Chinese financial dataset show that the proposed approach achieves 91.02% precision and 92.77% recall.
AB - Named entity recognition plays an important role in many natural language processing tasks, such as relation detection and information extraction. This paper presents a novel method to recognize named entities infinancial news texts in three steps. First, the domain dictionary is applied to recognize stock names. Second, the full form FNEs are identified by incorporating internal features in a classifier based on Conditional Random Fields. Third, the mutual information, boundary entropy and context features are employed to recognize the abbreviation FNE candidates. The experiments completed on a Chinese financial dataset show that the proposed approach achieves 91.02% precision and 92.77% recall.
KW - Conditional Random Fields
KW - Financial named entity
KW - Information Entropy
KW - Named entities recognition
UR - http://www.scopus.com/inward/record.url?scp=84921502704&partnerID=8YFLogxK
U2 - 10.1109/ICMLC.2014.7009718
DO - 10.1109/ICMLC.2014.7009718
M3 - Conference contribution
AN - SCOPUS:84921502704
T3 - Proceedings - International Conference on Machine Learning and Cybernetics
SP - 838
EP - 843
BT - Proceedings of 2014 International Conference on Machine Learning and Cybernetics, ICMLC 2014
PB - IEEE Computer Society
T2 - 13th International Conference on Machine Learning and Cybernetics, ICMLC 2014
Y2 - 13 July 2014 through 16 July 2014
ER -