Automated Software Entity Matching Between Successive Versions

Bo Liu, Hui Liu*, Nan Niu, Yuxia Zhang, Guangjie Li, Yanjie Jiang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Version control systems are widely used to manage the evolution of software applications. However, such version control systems take source code as lines of plain text, and thus they cannot present the evolution of software entities embedded in the source code. To this end, a few approaches have been proposed to match software entities before and after a given commit, known as software entity matching algorithms. However, the accuracy of such algorithms requires further improvement. In this paper, we propose an automated iterative algorithm (called ReMapper) to match software entities between two successive versions. The key insight of ReMapper is that the qualified name, the implementation, and the references of a software entity together can distinguish it from others. It matches software entities iteratively because the mapping depends on the reference-based similarity whereas the reference-based similarity depends on the mapping of entities as well. We evaluated ReMapper on a benchmark consisting of 215 commits from 21 real-world projects. Our evaluation results suggest that ReMapper substantially outperformed the state of the art, reducing the number of mistakes (false positives plus false negatives) substantially by 85.8%. We also evaluated to what extent it may improve the automated refactoring discovery (mining) that relies heavily on automated entity matching. Our evaluation results suggest that it substantially improved the state of the art in refactoring discovery, improving recall by 6.9% and reducing the number of false positives by 72.6%.

源语言英语
主期刊名Proceedings - 2023 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023
出版商Institute of Electrical and Electronics Engineers Inc.
1615-1627
页数13
ISBN(电子版)9798350329964
DOI
出版状态已出版 - 2023
活动38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023 - Echternach, 卢森堡
期限: 11 9月 202315 9月 2023

出版系列

姓名Proceedings - 2023 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023

会议

会议38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023
国家/地区卢森堡
Echternach
时期11/09/2315/09/23

指纹

探究 'Automated Software Entity Matching Between Successive Versions' 的科研主题。它们共同构成独一无二的指纹。

引用此