TY - JOUR
T1 - A bootstrapping based approach for open geo-entity relation extraction
AU - Yu, Li
AU - Lu, Feng
AU - Liu, Xiliang
N1 - Publisher Copyright:
© 2016, Surveying and Mapping Press. All right reserved.
PY - 2016/5/1
Y1 - 2016/5/1
N2 - Extracting spatial relations and semantic relations between two geo-entities from Web texts, asks robust and effective solutions. This paper puts forward a novel approach: firstly, the characteristics of terms (part-of-speech, position and distance) are analyzed by means of bootstrapping. Secondly, the weight of each term is calculated and the keyword is picked out as the clue of geo-entity relations. Thirdly, the geo-entity pairs and their keywords are organized into structured information. Finally, an experiment is conducted with Baidubaike and Stanford CoreNLP. The study shows that the presented method can automatically explore part of the lexical features and find additional relational terms which neither the domain expert knowledge nor large scale corpora need. Moreover, compared with three classical frequency statistics methods, namely Frequency, TF-IDF and PPMI, the precision and recall are improved about 5% and 23% respectively.
AB - Extracting spatial relations and semantic relations between two geo-entities from Web texts, asks robust and effective solutions. This paper puts forward a novel approach: firstly, the characteristics of terms (part-of-speech, position and distance) are analyzed by means of bootstrapping. Secondly, the weight of each term is calculated and the keyword is picked out as the clue of geo-entity relations. Thirdly, the geo-entity pairs and their keywords are organized into structured information. Finally, an experiment is conducted with Baidubaike and Stanford CoreNLP. The study shows that the presented method can automatically explore part of the lexical features and find additional relational terms which neither the domain expert knowledge nor large scale corpora need. Moreover, compared with three classical frequency statistics methods, namely Frequency, TF-IDF and PPMI, the precision and recall are improved about 5% and 23% respectively.
KW - Bootstrapping
KW - Geo-entities
KW - Quantitative evaluation
KW - Relation extraction
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=84973460138&partnerID=8YFLogxK
U2 - 10.11947/j.AGCS.2016.20150181
DO - 10.11947/j.AGCS.2016.20150181
M3 - Article
AN - SCOPUS:84973460138
SN - 1001-1595
VL - 45
SP - 616
EP - 622
JO - Acta Geodaetica et Cartographica Sinica
JF - Acta Geodaetica et Cartographica Sinica
IS - 5
ER -