Leveraging Conceptualization for Short-Text Embedding

Heyan Huang, Yashen Wang, Chong Feng*, Zhirun Liu, Qiang Zhou

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

36 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 35
  • Captures
    • Readers: 22
see details

Abstract

Most short-text embedding models typically represent each short-text only using the literal meanings of the words, which makes these models indiscriminative for the ubiquitous polysemy. In order to enhance the semantic representation capability of the short-texts, we (i) propose a novel short-text conceptualization algorithm to assign the associated concepts for each short-text, and then (ii) introduce the conceptualization results into learning the conceptual short-text embeddings. Hence, this semantic representation is more expressive than some widely-used text representation models such as the latent topic model. Wherein, the short-text conceptualization algorithm used here is based on a novel co-ranking framework, enabling the signals (i.e., the words and the concepts) to fully interplay to derive the solid conceptualization for the short-texts. Afterwards, we further extend the conceptual short-text embedding models by utilizing an attention-based model that selects the relevant words within the context to make more efficient prediction. The experiments on the real-world datasets demonstrate that the proposed conceptual short-text embedding model and short-text conceptualization algorithm are more effective than the state-of-the-art methods.

Original languageEnglish
Pages (from-to)1282-1295
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume30
Issue number7
DOIs
Publication statusPublished - 1 Jul 2018

Keywords

  • Short-text conceptualization
  • co-ranking framework
  • conceptual short-text embedding
  • semantic network

Fingerprint

Dive into the research topics of 'Leveraging Conceptualization for Short-Text Embedding'. Together they form a unique fingerprint.

Cite this

Huang, H., Wang, Y., Feng, C., Liu, Z., & Zhou, Q. (2018). Leveraging Conceptualization for Short-Text Embedding. IEEE Transactions on Knowledge and Data Engineering, 30(7), 1282-1295. https://doi.org/10.1109/TKDE.2017.2787709