摘要
This paper proposes a novel similarity measure for automatic text summarization. The topic space model is built through the Latent Dirichlet Allocation. The word, sentence, document and corpus are represented as vectors in the same topic space. LMMR and LSD algorithm are introduced to create the summary. An experiment is illustrated on DUC data and the results prove the proposed measure and algorithm effective and well performed.
源语言 | 英语 |
---|---|
页(从-至) | 2944-2949 |
页数 | 6 |
期刊 | Procedia Engineering |
卷 | 29 |
DOI | |
出版状态 | 已出版 - 2012 |
活动 | 2012 International Workshop on Information and Electronics Engineering, IWIEE 2012 - Harbin, 中国 期限: 10 3月 2012 → 11 3月 2012 |