BIT at SemEval-2016 task 1: Sentence similarity based on alignments and vector with the weight of information content

Hao Wu, Heyan Huang, Wenpeng Lu

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

This paper describes three unsupervised systems for determining the semantic similarity between two short texts or sentences submitted to the SemEval 2016 Task 1, all of which make use of only off-the-shelf software and data making them easy to replicate. Two systems achieved a similar Pearson correlation coefficient (0.64661 by simple vector, 0.65319 by word alignments). We include experiments on using our alignment based system on evaluation data from the 2014 and 2015 STS shared task. The results suggest that beyond the core similarity algorithm, other factors such as data preprocessing and use of domain-specific knowledge are also important to similarity prediction performance.

源语言英语
主期刊名SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings
出版商Association for Computational Linguistics (ACL)
686-690
页数5
ISBN(电子版)9781941643952
DOI
出版状态已出版 - 2016
活动10th International Workshop on Semantic Evaluation, SemEval 2016 - San Diego, 美国
期限: 16 6月 201617 6月 2016

出版系列

姓名SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings

会议

会议10th International Workshop on Semantic Evaluation, SemEval 2016
国家/地区美国
San Diego
时期16/06/1617/06/16

指纹

探究 'BIT at SemEval-2016 task 1: Sentence similarity based on alignments and vector with the weight of information content' 的科研主题。它们共同构成独一无二的指纹。

引用此