Deep learning based feature envy detection

Hui Liu; Zhifeng Xu; Yanzhen Zou

doi:10.1145/3238147.3238166

Deep learning based feature envy detection

Hui Liu^*, Zhifeng Xu, Yanzhen Zou

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

86 Citations (Scopus)

Abstract

Software refactoring is widely employed to improve software quality. A key step in software refactoring is to identify which part of the software should be refactored. To facilitate the identification, a number of approaches have been proposed to identify certain structures in the code (called code smells) that suggest the possibility of refactoring. Most of such approaches rely on manually designed heuristics to map manually selected source code metrics to predictions. However, it is challenging to manually select the best features, especially textual features. It is also difficult to manually construct the optimal heuristics. To this end, in this paper we propose a deep learning based novel approach to detecting feature envy, one of the most common code smells. The key insight is that deep neural networks and advanced deep learning techniques could automatically select features (especially textual features) of source code for feature envy detection, and could automatically build the complex mapping between such features and predictions. We also propose an automatic approach to generating labeled training data for the neural network based classifier, which does not require any human intervention. Evaluation results on open-source applications suggest that the proposed approach significantly improves the state-of-the-art in both detecting feature envy smells and recommending destinations for identified smelly methods.

Original language	English
Title of host publication	ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering
Editors	Christian Kastner, Marianne Huchard, Gordon Fraser
Publisher	Association for Computing Machinery, Inc
Pages	385-396
Number of pages	12
ISBN (Electronic)	9781450359375
DOIs	https://doi.org/10.1145/3238147.3238166
Publication status	Published - 3 Sept 2018
Event	33rd IEEE/ACM International Conference on Automated Software Engineering, ASE 2018 - Montpellier, France Duration: 3 Sept 2018 → 7 Sept 2018

Publication series

Name	ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering

Conference

Conference	33rd IEEE/ACM International Conference on Automated Software Engineering, ASE 2018
Country/Territory	France
City	Montpellier
Period	3/09/18 → 7/09/18

Keywords

Code smells
Deep learning
Feature envy
Software refactoring

Access to Document

10.1145/3238147.3238166

Cite this

Liu, H., Xu, Z., & Zou, Y. (2018). Deep learning based feature envy detection. In C. Kastner, M. Huchard, & G. Fraser (Eds.), ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering (pp. 385-396). (ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering). Association for Computing Machinery, Inc. https://doi.org/10.1145/3238147.3238166

Liu, Hui ; Xu, Zhifeng ; Zou, Yanzhen. / Deep learning based feature envy detection. ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. editor / Christian Kastner ; Marianne Huchard ; Gordon Fraser. Association for Computing Machinery, Inc, 2018. pp. 385-396 (ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering).

@inproceedings{93bfad1adbbf490cb1fff70abd32e2d7,

title = "Deep learning based feature envy detection",

abstract = "Software refactoring is widely employed to improve software quality. A key step in software refactoring is to identify which part of the software should be refactored. To facilitate the identification, a number of approaches have been proposed to identify certain structures in the code (called code smells) that suggest the possibility of refactoring. Most of such approaches rely on manually designed heuristics to map manually selected source code metrics to predictions. However, it is challenging to manually select the best features, especially textual features. It is also difficult to manually construct the optimal heuristics. To this end, in this paper we propose a deep learning based novel approach to detecting feature envy, one of the most common code smells. The key insight is that deep neural networks and advanced deep learning techniques could automatically select features (especially textual features) of source code for feature envy detection, and could automatically build the complex mapping between such features and predictions. We also propose an automatic approach to generating labeled training data for the neural network based classifier, which does not require any human intervention. Evaluation results on open-source applications suggest that the proposed approach significantly improves the state-of-the-art in both detecting feature envy smells and recommending destinations for identified smelly methods.",

keywords = "Code smells, Deep learning, Feature envy, Software refactoring",

author = "Hui Liu and Zhifeng Xu and Yanzhen Zou",

note = "Publisher Copyright: {\textcopyright} 2018 Copyright held by the owner/author(s).; 33rd IEEE/ACM International Conference on Automated Software Engineering, ASE 2018 ; Conference date: 03-09-2018 Through 07-09-2018",

year = "2018",

month = sep,

day = "3",

doi = "10.1145/3238147.3238166",

language = "English",

series = "ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering",

publisher = "Association for Computing Machinery, Inc",

pages = "385--396",

editor = "Christian Kastner and Marianne Huchard and Gordon Fraser",

booktitle = "ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering",

}

Liu, H, Xu, Z & Zou, Y 2018, Deep learning based feature envy detection. in C Kastner, M Huchard & G Fraser (eds), ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, Association for Computing Machinery, Inc, pp. 385-396, 33rd IEEE/ACM International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, 3/09/18. https://doi.org/10.1145/3238147.3238166

Deep learning based feature envy detection. / Liu, Hui; Xu, Zhifeng; Zou, Yanzhen.
ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. ed. / Christian Kastner; Marianne Huchard; Gordon Fraser. Association for Computing Machinery, Inc, 2018. p. 385-396 (ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Deep learning based feature envy detection

AU - Liu, Hui

AU - Xu, Zhifeng

AU - Zou, Yanzhen

PY - 2018/9/3

Y1 - 2018/9/3

N2 - Software refactoring is widely employed to improve software quality. A key step in software refactoring is to identify which part of the software should be refactored. To facilitate the identification, a number of approaches have been proposed to identify certain structures in the code (called code smells) that suggest the possibility of refactoring. Most of such approaches rely on manually designed heuristics to map manually selected source code metrics to predictions. However, it is challenging to manually select the best features, especially textual features. It is also difficult to manually construct the optimal heuristics. To this end, in this paper we propose a deep learning based novel approach to detecting feature envy, one of the most common code smells. The key insight is that deep neural networks and advanced deep learning techniques could automatically select features (especially textual features) of source code for feature envy detection, and could automatically build the complex mapping between such features and predictions. We also propose an automatic approach to generating labeled training data for the neural network based classifier, which does not require any human intervention. Evaluation results on open-source applications suggest that the proposed approach significantly improves the state-of-the-art in both detecting feature envy smells and recommending destinations for identified smelly methods.

AB - Software refactoring is widely employed to improve software quality. A key step in software refactoring is to identify which part of the software should be refactored. To facilitate the identification, a number of approaches have been proposed to identify certain structures in the code (called code smells) that suggest the possibility of refactoring. Most of such approaches rely on manually designed heuristics to map manually selected source code metrics to predictions. However, it is challenging to manually select the best features, especially textual features. It is also difficult to manually construct the optimal heuristics. To this end, in this paper we propose a deep learning based novel approach to detecting feature envy, one of the most common code smells. The key insight is that deep neural networks and advanced deep learning techniques could automatically select features (especially textual features) of source code for feature envy detection, and could automatically build the complex mapping between such features and predictions. We also propose an automatic approach to generating labeled training data for the neural network based classifier, which does not require any human intervention. Evaluation results on open-source applications suggest that the proposed approach significantly improves the state-of-the-art in both detecting feature envy smells and recommending destinations for identified smelly methods.

KW - Code smells

KW - Deep learning

KW - Feature envy

KW - Software refactoring

UR - http://www.scopus.com/inward/record.url?scp=85056520897&partnerID=8YFLogxK

U2 - 10.1145/3238147.3238166

DO - 10.1145/3238147.3238166

M3 - Conference contribution

AN - SCOPUS:85056520897

T3 - ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering

SP - 385

EP - 396

BT - ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering

A2 - Kastner, Christian

A2 - Huchard, Marianne

A2 - Fraser, Gordon

PB - Association for Computing Machinery, Inc

T2 - 33rd IEEE/ACM International Conference on Automated Software Engineering, ASE 2018

Y2 - 3 September 2018 through 7 September 2018

ER -

Liu H, Xu Z, Zou Y. Deep learning based feature envy detection. In Kastner C, Huchard M, Fraser G, editors, ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. Association for Computing Machinery, Inc. 2018. p. 385-396. (ASE 2018 - Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering). doi: 10.1145/3238147.3238166

Deep learning based feature envy detection

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this