Method-Level Bug Severity Prediction using Source Code Metrics and LLMs

Mashhadi, Ehsan; Ahmadvand, Hossein; Hemmati, Hadi

doi:10.5281/zenodo.8267597

Computer Science > Software Engineering

arXiv:2309.03044 (cs)

[Submitted on 6 Sep 2023]

Title:Method-Level Bug Severity Prediction using Source Code Metrics and LLMs

Authors:Ehsan Mashhadi, Hossein Ahmadvand, Hadi Hemmati

View PDF

Abstract:In the past couple of decades, significant research efforts are devoted to the prediction of software bugs. However, most existing work in this domain treats all bugs the same, which is not the case in practice. It is important for a defect prediction method to estimate the severity of the identified bugs so that the higher-severity ones get immediate attention. In this study, we investigate source code metrics, source code representation using large language models (LLMs), and their combination in predicting bug severity labels of two prominent datasets. We leverage several source metrics at method-level granularity to train eight different machine-learning models. Our results suggest that Decision Tree and Random Forest models outperform other models regarding our several evaluation metrics. We then use the pre-trained CodeBERT LLM to study the source code representations' effectiveness in predicting bug severity. CodeBERT finetuning improves the bug severity prediction results significantly in the range of 29%-140% for several evaluation metrics, compared to the best classic prediction model on source code metric. Finally, we integrate source code metrics into CodeBERT as an additional input, using our two proposed architectures, which both enhance the CodeBERT model effectiveness.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2309.03044 [cs.SE]
	(or arXiv:2309.03044v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2309.03044
Related DOI:	https://doi.org/10.5281/zenodo.8267597

Submission history

From: Ehsan Mashhadi [view email]
[v1] Wed, 6 Sep 2023 14:38:07 UTC (4,097 KB)

Computer Science > Software Engineering

Title:Method-Level Bug Severity Prediction using Source Code Metrics and LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Method-Level Bug Severity Prediction using Source Code Metrics and LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators