Exploiting multi-aspect interactions for god class detection with dataset fine-tuning

Shaojun Ren, Chongyang Shi*, Shuxin Zhao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

God class refers to a class that undertakes too many responsibilities for tasks that should more appropriately be handled by multiple classes. The existence of god classes seriously affects the maintainability and understandability of software. To eliminate god class, we first need to identify them. Researchers have proposed traditional methods using code metrics and deep learning methods using code metrics and text information to detect god classes. However, the relationship existing in metrics and text information is often ignored; moreover, deep learning methods require a large number of reliable datasets, while authentic god class datasets are scarce. To solve the above problems, we propose a novel god class detection method based on multi-aspect interactions and dataset fine-tuning. First, we use proposed model to extract multi-aspect interaction information, including three parts: (i) the interaction information existing in code metrics; (ii) the interaction information existing in texts; (iii) the interaction information existing in texts and code metrics. In this way, we can not only make use of code metrics and text information, but also fully exploit the multi-aspect interaction information. Second, we train with large-scale synthetic datasets to obtain a pre-trained model, then fine-tune the pre-trained model parameters with high-quality authentic datasets. Using the training method of pre-training and fine-tuning, we can solve the problem of low-reliability synthetic datasets and scarce authentic datasets. Finally, evaluation results on open-source applications suggest that the proposed approach improves on the state-of-the-art.

源语言英语
主期刊名Proceedings - 2021 IEEE 45th Annual Computers, Software, and Applications Conference, COMPSAC 2021
编辑W. K. Chan, Bill Claycomb, Hiroki Takakura, Ji-Jiang Yang, Yuuichi Teranishi, Dave Towey, Sergio Segura, Hossain Shahriar, Sorel Reisman, Sheikh Iqbal Ahamed
出版商Institute of Electrical and Electronics Engineers Inc.
864-873
页数10
ISBN(电子版)9781665424639
DOI
出版状态已出版 - 7月 2021
活动45th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2021 - Virtual, Online, 西班牙
期限: 12 7月 202116 7月 2021

出版系列

姓名Proceedings - 2021 IEEE 45th Annual Computers, Software, and Applications Conference, COMPSAC 2021

会议

会议45th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2021
国家/地区西班牙
Virtual, Online
时期12/07/2116/07/21

指纹

探究 'Exploiting multi-aspect interactions for god class detection with dataset fine-tuning' 的科研主题。它们共同构成独一无二的指纹。

引用此