What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding

Han Yang, Leilei Zhang, Fenglong Su, Jinhui Pang*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Knowledge graph (KG) embedding aims to embed entities and relations into a low-dimensional vector space, which has been an active research topic for knowledge base completion (KGC). Recent researchers improve existing models in terms of knowledge representation space, scoring function, encoding method, etc., have achieved progressive improvements. However, the theoretical mechanism behind them has always been ignored. There are few works on sensitivity analysis of embedded models, which is extremely challenging. The diversity of KGE models makes it difficult to consider them uniformly and compare them fairly. In this paper, we first study the internal connections and mutual transformation methods of different KGE models from the generic group perspective, and further propose a unified KGE learning framework. Then, we conduct an in-depth sensitivity analysis on the factors that affect the objective of embedding learning. Specifically, in addition to the impact of the embedding algorithm itself, this article also considers the structural features of the dataset and the strategies of the training method. After a comprehensive experiment and analysis, we can conclude that the Head-to-Tail rate of datasets, the definition of model metric function, the number of negative samples and the selection of regularization methods have a greater impact on the final performance.

源语言英语
主期刊名Database Systems for Advanced Applications - 27th International Conference, DASFAA 2022, Proceedings
编辑Arnab Bhattacharya, Janice Lee Mong Li, Divyakant Agrawal, P. Krishna Reddy, Mukesh Mohania, Anirban Mondal, Vikram Goyal, Rage Uday Kiran
出版商Springer Science and Business Media Deutschland GmbH
698-713
页数16
ISBN(印刷版)9783031001222
DOI
出版状态已出版 - 2022
活动27th International Conference on Database Systems for Advanced Applications, DASFAA 2022 - Virtual, Online
期限: 11 4月 202214 4月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13245 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议27th International Conference on Database Systems for Advanced Applications, DASFAA 2022
Virtual, Online
时期11/04/2214/04/22

指纹

探究 'What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding' 的科研主题。它们共同构成独一无二的指纹。

引用此