skip to main content
research-article

Athletic signature: predicting the next game lineup in collegiate basketball

Published: 14 September 2024 Publication History

Abstract

The advances in machine learning (ML) tools and techniques have enabled the non-intrusive collection and rapid analysis of massive amounts of data involving athletes in competitive collegiate sports. It has facilitated the development of services that a coach can employ in analyzing these data into actionable insights in designing training schedules and effective strategies for maximizing an athlete’s performance, while minimizing injury risk. Collegiate sports utilize data to get a competitive advantage. While game statistics are publicly available, relying on more than one form of data can help reveal a pattern. We developed a framework that considers various modalities and creates an athletic signature to predict their future performance. Our research involves the study of 42 distinct features that quantify various internal/external stressors the athletes face to characterize and estimate their athletic readiness (in the form of reactive strength index modified—RSImod) using ML algorithms. Our study, conducted over 26 weeks with 17 collegiate women’s basketball athletes, developed a framework that first performed sensitivity analysis using a hybrid approach combining the strengths of various filter-based, wrapper-based, and embedded feature importance techniques to identify the features most significantly impacting athlete readiness. These features were then categorized into four moderating variables (MVs, i.e. factors): sleep, cardiac rhythm, training strain, and travel schedule. Further, we used factor analysis to enhance interpretability and reduce computational complexity. A hybrid boosted-decision-trees-based model designed based on athlete clusters predicted future athletic readiness based on MVs with a mean squared error (MSE) of 0.0102. Partial dependence plots (PDPs) helped qualitatively learn the relationship between the moderating variables and the RSImod score. Athletic signatures, uniquely defining athlete-specific MV patterns, account for intra-individual variability, offering a better statistical basis for predicting game lineup (green/yellow/red card assignment) in combination with model predictions. SHAP (SHapley Additive exPlanations) values suggest the causative MV in order of significance for each prediction, enabling coaches to make informed decisions about training adjustments and athlete well-being, leading to performance improvement. Using the fingerprint mechanism, we created green (within 1 Standard Deviation (SD)), yellow (> 1SD and < 2SD), and red card (> 2SD) zones for athlete readiness assessment. While, this study was conducted on D-I women’s basketball, the modalities apply to several sports, such as soccer, volleyball, football, and ice hockey. This framework allows coaches to understand their athlete dynamics from a strictly data perspective, which helps them strategize their next moves, combined with their personal experience and interactions with the team.

References

[1]
Halson SL Monitoring training load to understand fatigue in athletes Sport Med 2014 44 139-147
[2]
Heishman A, Brown B, Daub B, Miller R, Freitas E, and Bemben M The influence of countermovement jump protocol on reactive strength index modified and flight time: contraction time in collegiate basketball players Sports 2019 7 2 37
[3]
Oved N, Feder A, and Reichart R Predicting in-game actions from interviews of NBA players Comput Linguist 2020 46 3 667-712
[4]
Talukder H, Vincent T, Foster G, Hu C, Huerta J, Kumar A, Malazarte M, Saldana D, Simpson S (2016) Preventing in-game injuries for nba players. In Proc. MIT Sloan Sports Analytics Conference, Boston, 2016, p. 11–12
[5]
Ghada S, Ahmed M, Seif E, et al (2017) Predicting all star player in the national basketball association using random forest. In: 2017 Intelligent Systems Conference (IntelliSys). IEEE. 2017, p. 706–713
[6]
Mikołajec K, Maszczyk A, and Zajac T Game indicators determining sports performance in the NBA J Human Kinet 2013 37 145
[7]
Nguyen NH et al. The application of machine learning and deep learning in sport: predicting NBA players’ performance and popularity” J Inf Telecommun 2021 6 217-235
[8]
Senbel S, Sharma S, Raval MS, Taber C, Nolan J, Artan NS, and Kaya T Impact of sleep and training on game performance and injury in division-1 women’s basketball Amidst the Pandemic IEEE Access 2022 10 15516-15527
[9]
Sharma SU, Divakaran S, Kaya T, Raval M (2022) A hybrid approach for interpretable game performance prediction in basketball. In 2022 International Joint Conference on Neural Networks (IJCNN). IEEE. p. 01–08
[10]
Sarlis V and Tjortjis C Sports analytics: data mining to uncover NBA player position, age, and injury impact on performance and economics Information 2024 15 4 242
[11]
Su F and Chen M Basketball players’ score prediction using artificial intelligence technology via the Internet of Things J Supercomput 2022 78 17 19138-19166
[12]
Zhu Q, Miao J, Liu J, Huang L (2024) Predict the college sports scores using a weighted BP-SVR model
[13]
Cronin J and Hansen K Strength and power predictors of sport speed J Strength Cond 2005 19 2 349-357
[14]
Kipp K, Kiely MT, Giordanelli MD, Malloy PJ, and Geiser CF Biomechanical determinants of the reactive strength index during drop jumps Int J Sports Physiol Perform 2018 13 1 44-49
[15]
Jarvis P, Turner A, Read P, and Bishop C Reactive strength index and its associations with measures of physical and sports performance: a systematic review with meta-analysis Sports Med 2022 52 2 301-330
[16]
Barker LA, Harry JR, and Mercer JA Relationships between countermovement jump ground reaction forces and jump height, reactive strength index, and jump time J Strength Cond Res 2018 32 1 248-254
[17]
Taber CB, Sharma S, Raval MS, Senbel S, Keefe A, Shah J, Patterson E, Nolan J, SertacArtan N, and Kaya T A holistic approach to performance prediction in collegiate athletics: player, team, and conference perspectives Sci Rep 2024 14 1 1162
[18]
Wu PP-Y, Sterkenburg N, Everett K, Chapman DW, White N, and Mengersen K Predicting fatigue using countermovement jump force-time signatures: PCA can distinguish neuromuscular versus metabolic fatigue PLoS ONE 2019 14 7 e0219295
[19]
Roethlingshoefer J and McConnell D Intent: a practical approach to applied sport science for athletic development 2018 Arvada Freeze Time Media
[20]
de Freitas Cruz I, Pereira LA, Kobal R, Kitamura K, Cedra C, Loturco I, and Abad CCC Perceived training load and jumping responses following nine weeks of a competitive period in young female basketball players PeerJ 2018 6
[21]
Harms N (2018) The impact of WHOOP technology on sleep, recovery, and performance in naia baseball players
[22]
Blanchard N, Skinner K, Kemp A, Scheirer W, Flynn P (2019) Keep Me In, Coach!: a computer vision perspective on assessing ACL injury risk in female athletes. In 2019 IEEE Winter conference on applications of computer vision (WACV) p. 1366–1374. IEEE
[23]
Hall MA (2000) Correlation-based feature selection of discrete and numeric class machine learning
[24]
Saarela M and Jauhiainen S Comparison of feature importance measures as explanations for classification models SN Appl Sci 2021 3 1-12
[25]
Drobnič F, Kos A, and Pustišek M On the interpretability of machine learning models and experimental feature selection in case of multicollinear data Electronics 2020 9 5 761
[26]
Kroese DP, Botev Z, Taimre T, and Vaisman R Data science and machine learning: mathematical and statistical methods 2019 Boca Raton CRC Press
[27]
BabaeeKhobdeh S, Yamaghani MR, and KhodaparastSareshkeh S Clustering of basketball players using self-organizing map neural networks J Appl Res Ind Eng 2021 8 4 412-428
[28]
Shahapure KR, Nicholas C (2020) Cluster quality analysis using silhouette score. In 2020 IEEE 7th international conference on data science and advanced analytics (DSAA) p. 747–748. IEEE
[29]
Avanijaa J Prediction of house price using xgboost regression algorithm Turk J Comput Math Educ (TURCOMAT) 2021 12 2 2151-2155
[30]
Lalwani A, Saraiya A, Singh A, Jain A, Dash T (2022) Machine learning in sports: a case study on using explainable models for predicting outcomes of volleyball matches. arXiv preprint arXiv:2206.09258
[31]
Silva AF, Oliveira R, Akyildiz Z, Yıldız M, Ocak Y, Günay M, and Clemente FM Sleep quality and training intensity in soccer players: exploring weekly variations and relationships Appl Sci 2022 12 6 2791

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Neural Computing and Applications
Neural Computing and Applications  Volume 36, Issue 34
Dec 2024
692 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 14 September 2024
Accepted: 28 August 2024
Received: 26 February 2024

Author Tags

  1. Athletic signature
  2. Athlete clustering
  3. Collegiate basketball
  4. Feature sensitivity analysis
  5. Game lineup prediction
  6. Sports data analytics

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Jan 2025

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media