Which Words Pillar the Semantic Expression of a Sentence?

Cheng Zhang, Jingxu Cao, Dongmei Yan*, Dawei Song, Jinxin Lv

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 5
see details

摘要

In the realm of machine learning, a profound understanding of sentence semantics holds paramount importance for various applications, notably text classification. Traditionally, this comprehension has been entrusted to deep learning models, despite their computationally intensive nature, particularly when dealing with lengthy sequences. The nuanced impact of individual words within a sentence on semantic expression necessitates a strategic removal of less pertinent words to alleviate the computational burden of the model. Presently, prevailing approaches for word removal predominantly employ methods such as truncation, stop-word elimination and attention mechanisms. Regrettably, these techniques often lack a robust theoretical foundation concerning semantics and interpretability. To bridge this conceptual gap, our study introduces the concept of 'Semantic Pillar Words' (SPW) within a sentence, anchored in a Semantic Euclidean space. Here, the semantics of a word are represented as a constellation of semantic points, with a text sequence encapsulating the convex hull of these semantic points of words. We propose a novel method for Semantic Pillar Word extraction, known as 'SPW-Conv', which dynamically and interpretably prunes text segments, striving to preserve the semantic pillars inherent in the original text. Our extensive experimentation encompasses three diverse text classification datasets, revealing that SPW-Conv outperforms existing methods. Remarkably, it becomes evident that retaining less than 80% of the words within a sentence suffices to capture its semantics adequately, all while achieving classification accuracy levels comparable to those obtained using the entire original text.

源语言英语
主期刊名Proceedings - 2023 IEEE 35th International Conference on Tools with Artificial Intelligence, ICTAI 2023
出版商IEEE Computer Society
791-798
页数8
ISBN(电子版)9798350342734
DOI
出版状态已出版 - 2023
已对外发布
活动35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023 - Atlanta, 美国
期限: 6 11月 20238 11月 2023

出版系列

姓名Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
ISSN(印刷版)1082-3409

会议

会议35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023
国家/地区美国
Atlanta
时期6/11/238/11/23

指纹

探究 'Which Words Pillar the Semantic Expression of a Sentence?' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, C., Cao, J., Yan, D., Song, D., & Lv, J. (2023). Which Words Pillar the Semantic Expression of a Sentence?Proceedings - 2023 IEEE 35th International Conference on Tools with Artificial Intelligence, ICTAI 2023 (页码 791-798). (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI). IEEE Computer Society. https://doi.org/10.1109/ICTAI59109.2023.00121