Which Words Pillar the Semantic Expression of a Sentence?

Cheng Zhang, Jingxu Cao, Dongmei Yan*, Dawei Song, Jinxin Lv

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 5
see details

Abstract

In the realm of machine learning, a profound understanding of sentence semantics holds paramount importance for various applications, notably text classification. Traditionally, this comprehension has been entrusted to deep learning models, despite their computationally intensive nature, particularly when dealing with lengthy sequences. The nuanced impact of individual words within a sentence on semantic expression necessitates a strategic removal of less pertinent words to alleviate the computational burden of the model. Presently, prevailing approaches for word removal predominantly employ methods such as truncation, stop-word elimination and attention mechanisms. Regrettably, these techniques often lack a robust theoretical foundation concerning semantics and interpretability. To bridge this conceptual gap, our study introduces the concept of 'Semantic Pillar Words' (SPW) within a sentence, anchored in a Semantic Euclidean space. Here, the semantics of a word are represented as a constellation of semantic points, with a text sequence encapsulating the convex hull of these semantic points of words. We propose a novel method for Semantic Pillar Word extraction, known as 'SPW-Conv', which dynamically and interpretably prunes text segments, striving to preserve the semantic pillars inherent in the original text. Our extensive experimentation encompasses three diverse text classification datasets, revealing that SPW-Conv outperforms existing methods. Remarkably, it becomes evident that retaining less than 80% of the words within a sentence suffices to capture its semantics adequately, all while achieving classification accuracy levels comparable to those obtained using the entire original text.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 35th International Conference on Tools with Artificial Intelligence, ICTAI 2023
PublisherIEEE Computer Society
Pages791-798
Number of pages8
ISBN (Electronic)9798350342734
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023 - Atlanta, United States
Duration: 6 Nov 20238 Nov 2023

Publication series

NameProceedings - International Conference on Tools with Artificial Intelligence, ICTAI
ISSN (Print)1082-3409

Conference

Conference35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023
Country/TerritoryUnited States
CityAtlanta
Period6/11/238/11/23

Keywords

  • Convex hull
  • Natural Language Processing
  • Semantic Pillar Words

Fingerprint

Dive into the research topics of 'Which Words Pillar the Semantic Expression of a Sentence?'. Together they form a unique fingerprint.

Cite this

Zhang, C., Cao, J., Yan, D., Song, D., & Lv, J. (2023). Which Words Pillar the Semantic Expression of a Sentence? In Proceedings - 2023 IEEE 35th International Conference on Tools with Artificial Intelligence, ICTAI 2023 (pp. 791-798). (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI). IEEE Computer Society. https://doi.org/10.1109/ICTAI59109.2023.00121