Abstract
This paper proposes a novel similarity measure for automatic text summarization. The topic space model is built through the Latent Dirichlet Allocation. The word, sentence, document and corpus are represented as vectors in the same topic space. LMMR and LSD algorithm are introduced to create the summary. An experiment is illustrated on DUC data and the results prove the proposed measure and algorithm effective and well performed.
Original language | English |
---|---|
Pages (from-to) | 2944-2949 |
Number of pages | 6 |
Journal | Procedia Engineering |
Volume | 29 |
DOIs | |
Publication status | Published - 2012 |
Event | 2012 International Workshop on Information and Electronics Engineering, IWIEE 2012 - Harbin, China Duration: 10 Mar 2012 → 11 Mar 2012 |
Keywords
- Automatic text summarization
- Latent Dirichlet Allocation
- Similarity measure