What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding

Han Yang, Leilei Zhang, Fenglong Su, Jinhui Pang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Knowledge graph (KG) embedding aims to embed entities and relations into a low-dimensional vector space, which has been an active research topic for knowledge base completion (KGC). Recent researchers improve existing models in terms of knowledge representation space, scoring function, encoding method, etc., have achieved progressive improvements. However, the theoretical mechanism behind them has always been ignored. There are few works on sensitivity analysis of embedded models, which is extremely challenging. The diversity of KGE models makes it difficult to consider them uniformly and compare them fairly. In this paper, we first study the internal connections and mutual transformation methods of different KGE models from the generic group perspective, and further propose a unified KGE learning framework. Then, we conduct an in-depth sensitivity analysis on the factors that affect the objective of embedding learning. Specifically, in addition to the impact of the embedding algorithm itself, this article also considers the structural features of the dataset and the strategies of the training method. After a comprehensive experiment and analysis, we can conclude that the Head-to-Tail rate of datasets, the definition of model metric function, the number of negative samples and the selection of regularization methods have a greater impact on the final performance.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - 27th International Conference, DASFAA 2022, Proceedings
EditorsArnab Bhattacharya, Janice Lee Mong Li, Divyakant Agrawal, P. Krishna Reddy, Mukesh Mohania, Anirban Mondal, Vikram Goyal, Rage Uday Kiran
PublisherSpringer Science and Business Media Deutschland GmbH
Pages698-713
Number of pages16
ISBN (Print)9783031001222
DOIs
Publication statusPublished - 2022
Event27th International Conference on Database Systems for Advanced Applications, DASFAA 2022 - Virtual, Online
Duration: 11 Apr 202214 Apr 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13245 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on Database Systems for Advanced Applications, DASFAA 2022
CityVirtual, Online
Period11/04/2214/04/22

Keywords

  • Group theory
  • Knowledge graph embedding
  • Sensitivity analysis

Fingerprint

Dive into the research topics of 'What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding'. Together they form a unique fingerprint.

Cite this

Yang, H., Zhang, L., Su, F., & Pang, J. (2022). What Affects the Performance of Models? Sensitivity Analysis of Knowledge Graph Embedding. In A. Bhattacharya, J. Lee Mong Li, D. Agrawal, P. K. Reddy, M. Mohania, A. Mondal, V. Goyal, & R. Uday Kiran (Eds.), Database Systems for Advanced Applications - 27th International Conference, DASFAA 2022, Proceedings (pp. 698-713). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13245 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-00123-9_55