research-article

Bug Localization with Semantic and Structural Features using Convolutional Neural Network and Cascade Forest

Authors:

Yan Xiao,

Jacky Keung,

Qing Mi,

Kwabena E. BenninAuthors Info & Claims

EASE '18: Proceedings of the 22nd International Conference on Evaluation and Assessment in Software Engineering 2018

Pages 101 - 111

https://doi.org/10.1145/3210459.3210469

Published: 28 June 2018 Publication History

Get Access

Abstract

Background: Correctly localizing buggy files for bug reports together with their semantic and structural information is a crucial task, which would essentially improve the accuracy of bug localization techniques. Aims: To empirically evaluate and demonstrate the effects of both semantic and structural information in bug reports and source files on improving the performance of bug localization, we propose CNN_Forest involving convolutional neural network and ensemble of random forests that have excellent performance in the tasks of semantic parsing and structural information extraction. Method: We first employ convolutional neural network with multiple filters and an ensemble of random forests with multi-grained scanning to extract semantic and structural features from the word vectors derived from bug reports and source files. And a subsequent cascade forest (a cascade of ensembles of random forests) is used to further extract deeper features and observe the correlated relationships between bug reports and source files. CNNLForest is then empirically evaluated over 10,754 bug reports extracted from AspectJ, Eclipse UI, JDT, SWT, and Tomcat projects. Results: The experiments empirically demonstrate the significance of including semantic and structural information in bug localization, and further show that the proposed CNN_Forest achieves higher Mean Average Precision and Mean Reciprocal Rank measures than the best results of the four current state-of-the-art approaches (NPCNN, LR+WE, DNNLOC, and BugLocator). Conclusion: CNNLForest is capable of defining the correlated relationships between bug reports and source files, and we empirically show that semantic and structural information in bug reports and source files are crucial in improving bug localization.

References

[1]

Dave Binkley, Marcia Davis, Dawn Lawrie, and Christopher Morrell. 2009. To camelcase or underscore. In Program Comprehension, 2009. ICPC'09. IEEE 17th International Conference on. IEEE, 158--167.

Abstract

References

Cited By

Index Terms

Recommendations

Bug localization via searching crowd-contributed code

Design and development of novel hybrid optimization-based convolutional neural network for software bug localization

BugLocalizer: integrated tool support for bug localization

Comments

Information

Published In

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations