Improved Chinese sentence semantic similarity calculation method based on multi-feature fusion

Liqi Liu, Qinglin Wang, Yuan Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

In this paper, an improved long short-term memory (LSTM)-based deep neural network structure is proposed for learning variable-length Chinese sentence semantic similarities. Siamese LSTM, a sequence-insensitive deep neural network model, has a limited ability to capture the semantics of natural language because it has difficulty explaining semantic differences based on the differences in syntactic structures or word order in a sentence. Therefore, the proposed model integrates the syntactic component features of the words in the sentence into a word vector representation layer to express the syntactic structure information of the sentence and the interdependence between words. Moreover, a relative position embedding layer is introduced into the model, and the relative position of the words in the sentence is mapped to a high-dimensional space to capture the local position information of the words. With this model, a parallel structure is used to map two sentences into the same high-dimensional space to obtain a fixed-length sentence vector representation. After aggregation, the sentence similarity is computed in the output layer. Experiments with Chinese sentences show that the model can achieve good results in the calculation of the semantic similarity.

Original languageEnglish
Pages (from-to)442-449
Number of pages8
JournalJournal of Advanced Computational Intelligence and Intelligent Informatics
Volume25
Issue number4
DOIs
Publication statusPublished - Jul 2021

Keywords

  • LSTM
  • Relative position embedding
  • Semantic similarity
  • Syntactic component

Fingerprint

Dive into the research topics of 'Improved Chinese sentence semantic similarity calculation method based on multi-feature fusion'. Together they form a unique fingerprint.

Cite this