摘要
Knowledge Graphs (KGs) are crucial resources for supporting geographical knowledge services. Given the vast geographical knowledge in web text, extraction of geo-entity relations from web text has become the core technology for constructing geographical KGs. Furthermore, it directly affects the quality of geographical knowledge services. However, web text inevitably contains noise and geographical knowledge can be sparsely distributed, both greatly restricting the quality of geo-entity relationship extraction. Here, we proposed a method for filtering geo-entity relations based on existing Knowledge Bases (KBs). Specifically, ontology knowledge, fact knowledge, and synonym knowledge were integrated to generate geo-related knowledge. Then, the extracted geo-entity relationships and the geo-related knowledge were transferred into vectors, and the maximum similarity between vectors was the confidence value of one extracted geo-entity relationship triple. Our method takes full advantage of existing KBs to assess the quality of geographical information in web text, which helps improve the richness and freshness of geographical KGs. Compared with the Stanford OpenIE method, our method decreased the Mean Square Error (MSE) from 0.62 to 0.06 in the confidence interval [0.7, 1], and improved the area under the Receiver Operating Characteristic (ROC) Curve (AUC) from 0.51 to 0.89.
投稿的翻译标题 | A Knowledge-based Method for Filtering Geo-entity Relations |
---|---|
源语言 | 繁体中文 |
页(从-至) | 1392-1401 |
页数 | 10 |
期刊 | Journal of Geo-Information Science |
卷 | 21 |
期 | 9 |
DOI | |
出版状态 | 已出版 - 25 9月 2019 |
已对外发布 | 是 |
关键词
- Common knowledge bases
- Evaluation of geographic information quality
- Geo-KG building
- Geo-entity relations extraction
- Information filtering
- Open relation extraction
- Text data