COLARE: Commit Classification via Fine-grained Context-aware Representation of Code Changes

Qunhong Zeng, Yuxia Zhang*, Zeyu Sun, Yujie Guo, Hui Liu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Commit classification for maintenance activities is of critical importance for both industry and academia. State-of-the-art approaches either treat code changes as plain text or rely on manually identified features. Directly applying the most advanced model of code change representation into commit classification faces two limitations: (1) coarse-grained diff comparison neglects the distance of modified code lines; (2) missing key context information of hunk modification and file categories. This study proposes a novel classification model, COLARE, which compares code changes at the hunk level, takes fine-grained features based on categories of changed files, and aggregates with the representation of commit messages. The evaluation results show that our model outperforms state-of-the-art techniques by 7.24% and 7.35% in accuracy and macro F1 score, respectively. We also manually labeled a multi-language dataset and evaluated our approach, The results further confirm that our approach achieves the best performance over three baselines, including ChatGPT (3.5). The evaluation of the ablation study demonstrates the effectiveness of the major components in our technique.

源语言英语
主期刊名Proceedings - 2024 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2024
出版商Institute of Electrical and Electronics Engineers Inc.
752-763
页数12
ISBN(电子版)9798350330663
DOI
出版状态已出版 - 2024
活动31st IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2024 - Rovaniemi, 芬兰
期限: 12 3月 202415 3月 2024

出版系列

姓名Proceedings - 2024 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2024

会议

会议31st IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2024
国家/地区芬兰
Rovaniemi
时期12/03/2415/03/24

指纹

探究 'COLARE: Commit Classification via Fine-grained Context-aware Representation of Code Changes' 的科研主题。它们共同构成独一无二的指纹。

引用此