Unraveling Scientific Evolutionary Paths: An Embedding-Based Topic Analysis

Qianqian Jin, Hongshu Chen*, Yi Zhang, Xuefeng Wang, Donghua Zhu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Understanding the evolution of knowledge has been and will continue to be the key task of science, technology, and innovation management. Existing research on evolutionary path identification relies primarily on traditional co-occurrence analysis and bag-of-words (BOW)-based models for topic extraction. However, these approaches have limitations in effectively capturing the underlying semantics and linkages of the topics. In this article, we propose a novel embedding-based methodology for scientific evolution analysis, in which word embedding, document embedding, clustering, and network analysis are applied to extract topics, measure topical semantic similarities, and quantitatively distinguish topics' evolutionary states. We first perform benchmark experiments to demonstrate that doc2vec generally outperforms the BOW-based models in topic extraction before evolution analysis. We then consider topic consistency in vector spaces to identify evolutionary states including newborn, convergence, inheritance, and extinction. Scientific evolutionary paths are finally unraveled based on topic similarity matrixes and evolutionary states. We conduct a case study on object detection research to validate the effectiveness of our methodology. The empirical results, validated by domain experts, demonstrate that the proposed methodology is capable of effectively revealing patterns of knowledge inheritance and integration. Consequently, this methodology can be used to improve decision-making processes in future innovation management.

Original languageEnglish
Pages (from-to)8964-8978
Number of pages15
JournalIEEE Transactions on Engineering Management
Volume71
DOIs
Publication statusPublished - 2024

Keywords

  • Doc2vec
  • embedding
  • evolution analysis
  • evolutionary paths
  • topic extraction
  • word2vec

Fingerprint

Dive into the research topics of 'Unraveling Scientific Evolutionary Paths: An Embedding-Based Topic Analysis'. Together they form a unique fingerprint.

Cite this