Next Article in Journal
Evaluation of Condyle–Fossa Relationships in Adults with Various Skeletal Patterns Using Cone-Beam Computed Tomography
Previous Article in Journal
Identifying the Spatial Range of the Pearl River Delta Urban Agglomeration from a Differentiated Perspective of Population Distribution and Population Mobility
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Analyzing Key Features of Open Source Software Survivability with Random Forest

by
Sohee Park
and
Gihwon Kwon
*,†
Department of SW Safety and Cyber Security, Kyonggi University, Suwon-si 16227, Gyeonggi-do, Republic of Korea
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Appl. Sci. 2025, 15(2), 946; https://doi.org/10.3390/app15020946 (registering DOI)
Submission received: 27 November 2024 / Revised: 28 December 2024 / Accepted: 6 January 2025 / Published: 18 January 2025

Abstract

Open source software (OSS) projects rely on voluntary contributions, but their long-term survivability depends on sustained community engagement and effective problem-solving. Survivability, critical for maintaining project quality and trustworthiness, is closely linked to issue activity, as unresolved issues reflect a decline in maintenance capacity and problem-solving ability. Thus, analyzing issue retention rates provides valuable insights into a project’s health. This study evaluates OSS survivability by identifying the features that influence issue activity and analyzing their relationships with survivability. Kaplan–Meier survival analysis is employed to quantify issue activity and visualize trends in unresolved issue rates, providing a measure of project maintenance dynamics. A random forest model is used to examine the relationships between project features—such as popularity metrics, community engagement, code complexity, and project age—and issue retention rates. The results show that stars significantly reduce issue retention rates, with rates dropping from 0.62 to 0.52 as stars increase to 4000, while larger codebases, higher cyclomatic complexity, and older project age are associated with unresolved issue rates, rising by up to 15%. Forks also have a nonlinear impact, initially stabilizing retention rates but increasing unresolved issues as contributions became unmanageable. By identifying these critical factors and quantifying their impacts, this research offers actionable insights for OSS project managers to enhance project survivability and address key maintenance challenges, ensuring sustainable long-term success.
Keywords: Kaplan–Meier survival function; open source software; random forest; software maintenance; survivability Kaplan–Meier survival function; open source software; random forest; software maintenance; survivability

Share and Cite

MDPI and ACS Style

Park, S.; Kwon, G. Analyzing Key Features of Open Source Software Survivability with Random Forest. Appl. Sci. 2025, 15, 946. https://doi.org/10.3390/app15020946

AMA Style

Park S, Kwon G. Analyzing Key Features of Open Source Software Survivability with Random Forest. Applied Sciences. 2025; 15(2):946. https://doi.org/10.3390/app15020946

Chicago/Turabian Style

Park, Sohee, and Gihwon Kwon. 2025. "Analyzing Key Features of Open Source Software Survivability with Random Forest" Applied Sciences 15, no. 2: 946. https://doi.org/10.3390/app15020946

APA Style

Park, S., & Kwon, G. (2025). Analyzing Key Features of Open Source Software Survivability with Random Forest. Applied Sciences, 15(2), 946. https://doi.org/10.3390/app15020946

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop