skip to main content
10.1145/3183440.3195092acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
poster

DWEN: deep word embedding network for duplicate bug report detection in software repositories

Published: 27 May 2018 Publication History

Abstract

Bug report filing is a major part of software maintenance. Due to extensive number of bugs filed everyday in large software projects and the asynchronous nature of bug report filing ecosystem, duplicate bug reports are filed. Capturing and tagging duplicate bug reports is crucial in order to avoid assignment of the same bug to different developers. Efforts have been made in the past to detect duplicate bug reports by using topic modelling [2], discriminative methods [5], meta-attributes [6], etc. Recently, Yang et al. [8] proposed an approach to combine word embeddings, TF-IDF and meta-attributes to compute bug similarity between two bug reports.

References

[1]
David M Blei et al. 2003. Latent dirichlet allocation. Journal of Machine Learning Research (2003).
[2]
Nathan Klein et al. 2014. New features for duplicate bug detection. In Mining Software Repositories. ACM.
[3]
Alina Lazar et al. 2014. Generating duplicate bug datasets. In Mining Software Repositories. ACM.
[4]
Tomas Mikolov et al. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv.1301.3781.
[5]
Chengnian Sun et al. 2010. A discriminative model approach for accurate duplicate bug report retrieval. In International Conference on Software Engineering. ACM.
[6]
Chengnian Sun et al. 2011. Towards more accurate retrieval of duplicate bug reports. In Automated Software Engineering. IEEE Press.
[7]
Ashish Sureka et al. 2010. Detecting duplicate bug report using character n-gram-based features. In Asia Pacific Software Engineering Conference. IEEE Press.
[8]
Xinli Yang et al. 2016. Combining word embedding with information retrieval to recommend similar bug reports. In International Symposium on Software Reliability Engineering. IEEE.
[9]
Hugo Zaragoza et al. 2004. Microsoft Cambridge at TREC 13. In TREC.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICSE '18: Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings
May 2018
231 pages
ISBN:9781450356633
DOI:10.1145/3183440
  • Conference Chair:
  • Michel Chaudron,
  • General Chair:
  • Ivica Crnkovic,
  • Program Chairs:
  • Marsha Chechik,
  • Mark Harman
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2018

Check for updates

Qualifiers

  • Poster

Conference

ICSE '18
Sponsor:

Acceptance Rates

Overall Acceptance Rate 276 of 1,856 submissions, 15%

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)11
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media