TY - JOUR
T1 - Entity set expansion in knowledge graph
T2 - a heterogeneous information network perspective
AU - Shi, Chuan
AU - Ding, Jiayu
AU - Cao, Xiaohuan
AU - Hu, Linmei
AU - Wu, Bin
AU - Li, Xiaoli
N1 - Publisher Copyright:
© 2020, Higher Education Press.
PY - 2021/2/1
Y1 - 2021/2/1
N2 - Entity set expansion (ESE) aims to expand an entity seed set to obtain more entities which have common properties. ESE is important for many applications such as dictionary construction and query suggestion. Traditional ESE methods relied heavily on the text and Web information of entities. Recently, some ESE methods employed knowledge graphs (KGs) to extend entities. However, they failed to effectively and efficiently utilize the rich semantics contained in a KG and ignored the text information of entities in Wikipedia. In this paper, we model a KG as a heterogeneous information network (HIN) containing multiple types of objects and relations. Fine-grained multi-type meta paths are proposed to capture the hidden relation among seed entities in a KG and thus to retrieve candidate entities. Then we rank the entities according to the meta path based structural similarity. Furthermore, to utilize the text description of entities in Wikipedia, we propose an extended model CoMeSE++ which combines both structural information revealed by a KG and text information in Wikipedia for ESE. Extensive experiments on real-world datasets demonstrate that our model achieves better performance by combining structural and textual information of entities.
AB - Entity set expansion (ESE) aims to expand an entity seed set to obtain more entities which have common properties. ESE is important for many applications such as dictionary construction and query suggestion. Traditional ESE methods relied heavily on the text and Web information of entities. Recently, some ESE methods employed knowledge graphs (KGs) to extend entities. However, they failed to effectively and efficiently utilize the rich semantics contained in a KG and ignored the text information of entities in Wikipedia. In this paper, we model a KG as a heterogeneous information network (HIN) containing multiple types of objects and relations. Fine-grained multi-type meta paths are proposed to capture the hidden relation among seed entities in a KG and thus to retrieve candidate entities. Then we rank the entities according to the meta path based structural similarity. Furthermore, to utilize the text description of entities in Wikipedia, we propose an extended model CoMeSE++ which combines both structural information revealed by a KG and text information in Wikipedia for ESE. Extensive experiments on real-world datasets demonstrate that our model achieves better performance by combining structural and textual information of entities.
KW - entity set expansion
KW - heterogeneous information network
KW - knowledge graph
KW - multi-type meta path
UR - http://www.scopus.com/inward/record.url?scp=85091719729&partnerID=8YFLogxK
U2 - 10.1007/s11704-020-9240-8
DO - 10.1007/s11704-020-9240-8
M3 - Article
AN - SCOPUS:85091719729
SN - 2095-2228
VL - 15
JO - Frontiers of Computer Science
JF - Frontiers of Computer Science
IS - 1
M1 - 151307
ER -