research-article

The Scent of Deep Learning Code: An Empirical Study

Authors:

Hadhemi Jebnoun,

Houssem Ben Braiek,

Mohammad Masudur Rahman,

Foutse KhomhAuthors Info & Claims

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

Pages 420 - 430

https://doi.org/10.1145/3379597.3387479

Published: 18 September 2020 Publication History

Abstract

Deep learning practitioners are often interested in improving their model accuracy rather than the interpretability of their models. As a result, deep learning applications are inherently complex in their structures. They also need to continuously evolve in terms of code changes and model updates. Given these confounding factors, there is a great chance of violating the recommended programming practices by the developers in their deep learning applications. In particular, the code quality might be negatively affected due to their drive for the higher model performance. Unfortunately, the code quality of deep learning applications has rarely been studied to date. In this paper, we conduct an empirical study to investigate the distribution of code smells in deep learning applications. To this end, we perform a comparative analysis between deep learning and traditional open-source applications collected from GitHub. We have several major findings. First, long lambda expression, long ternary conditional expression, and complex container comprehension smells are frequently found in deep learning projects. That is, deep learning code involves more complex or longer expressions than the traditional code does. Second, the number of code smells increases across the releases of deep learning applications. Third, we found that there is a co-existence between code smells and software bugs in the studied deep learning code, which confirms our conjecture on the degraded code quality of deep learning applications.

References

[1]

Saleema Amershi, Andrew Begel, Christian Bird, Robert DeLine, Harald Gall, Ece Kamar, Nachiappan Nagappan, Besmira Nushi, and Thomas Zimmermann. 2019. Software engineering for machine learning: a case study. In Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice. IEEE Press, 291--300.

Digital Library

[2]

Houssem Ben Braiek, Foutse Khomh, and Bram Adams. 2018. The open-closed principle of modern machine learning frameworks. In 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR). IEEE, 353--363.

Digital Library

[3]

Paul C Brown. 2011. TIBCO Architecture Fundamentals. Addison-Wesley.

[4]

William H Brown, Raphael C Malveau, Hays W McCormick, and Thomas J Mowbray. 1998. AntiPatterns: refactoring software, architectures, and projects in crisis. John Wiley & Sons, Inc.

Digital Library

[5]

Alexander Chatzigeorgiou and Anastasios Manakos. 2010. Investigating the evolution of bad smells in object-oriented code. In 2010 Seventh International Conference on the Quality of Information and Communications Technology. IEEE, 106--115.

Digital Library

[6]

Zhifei Chen, Lin Chen, Wanwangying Ma, Xiaoyu Zhou, Yuming Zhou, and Baowen Xu. 2018. Understanding metric-based detectable smells in Python software: A comparative study. Information and Software Technology 94 (2018), 14--29.

Digital Library

[7]

Python community. 2019. Radon. Retrieved November 10, 2019 from https://pypi.org/project/radon/

[8]

Shaveta Dargan, Munish Kumar, Maruthi Rohit Ayyagari, and Gulshan Kumar. 2019. A Survey of Deep Learning and Its Applications: A New Paradigm to Machine Learning. Archives of Computational Methods in Engineering (2019), 1--22.

[9]

GitHub developers. 2019. GitHub Rest API Search topics. Retrieved November 19, 2019 from https://developer.github.com/v3/search/#search-topics

[10]

Martin Fowler. 2018. Refactoring: improving the design of existing code. Addison-Wesley Professional.

Digital Library

[11]

Martin Fowler, Kent Beck, and W Roberts Opdyke. 1997. Refactoring: Improving the design of existing code. In 11th European Conference. Jyväskylä, Finland.

[12]

Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org.

Digital Library

[13]

Md Johirul Islam, Giang Nguyen, Rangeet Pan, and Hridesh Rajan. 2019. A Comprehensive Study on Deep Learning Bug Characteristics. arXiv preprint arXiv:1906.01388 (2019).

[14]

Sunghun Kim and E James Whitehead Jr. 2006. How long did it take to fix bugs?. In Proceedings of the 2006 international workshop on Mining software repositories. ACM, 173--174.

Digital Library

[15]

A Gunes Koru, Dongsong Zhang, and Hongfang Liu. 2007. Modeling the effect of size on defect proneness for open-source software. In Third International Workshop on Predictor Models in Software Engineering (PROMISE'07: ICSE Workshops 2007). IEEE, 10--10.

Digital Library

[16]

Thanis Paiva, Amanda Damasceno, Eduardo Figueiredo, and Cláudio Sant'Anna. 2017. On the evaluation of code smells and detection tools. Journal of Software Engineering Research and Development 5, 1 (2017), 7.

[17]

Ladislav Rampasek and Anna Goldenberg. 2016. Tensorflow: Biology's gateway to deep learning? Cell systems 2, 1 (2016), 12--14.

[18]

Christoffer Rosen, Ben Grawi, and Emad Shihab. 2015. Commit guru: analytics and risk prediction of software commits. In Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering. ACM, 966--969.

Digital Library

[19]

scikit-learn developers (BSD License). 2019. scikit-learn Machine Learning in Python. Retrieved December 07, 2019 from https://scikit-learn.org/stable/

[20]

scikit-learn developers (BSD License). 2019. sklearn.preprocessing.KBinsDiscretizer. Retrieved December 07, 2019 from https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.KBinsDiscretizer.html

[21]

David Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Diet-mar Ebner, Vinay Chaudhary, and Michael Young. 2014. Machine learning: The high interest credit card of technical debt. (2014).

[22]

D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, and Dan Dennison. 2015. Hidden Technical Debt in Machine Learning Systems. In Advances in Neural Information Processing Systems 28, C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett (Eds.). Curran Associates, Inc., 2503--2511. http://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf

[23]

Jacek Śliwerski, Thomas Zimmermann, and Andreas Zeller. 2005. When do changes induce fixes?. In ACM sigsoft software engineering notes, Vol. 30. ACM, 1--5.

[24]

Michele Tufano, Fabio Palomba, Gabriele Bavota, Rocco Oliveto, Massimiliano Di Penta, Andrea De Lucia, and Denys Poshyvanyk. 2015. When and why your code starts to smell bad. In Proceedings of the 37th International Conference on Software Engineering-Volume 1. IEEE Press, 403--414.

Digital Library

[25]

Zhiyuan Wan, Xin Xia, David Lo, and Gail C Murphy. 2019. How does Machine Learning Change Software Development Practices? IEEE Transactions on Software Engineering (2019).

[26]

Yuhao Zhang, Yifan Chen, Shing-Chi Cheung, Yingfei Xiong, and Lu Zhang. 2018. An empirical study on TensorFlow program bugs. In Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis. ACM, 129--140.

Digital Library

[27]

Chen Zhifei. 2018. Pysmell a tool for detecting code smells in Python code. Retrieved November 01, 2019 from https://github.com/chenzhifei731/Pysmell

Cited By

Zhang BLiang PFeng QFu YLi ZFilkov VRay BZhou M(2024)Copilot-in-the-Loop: Fixing Code Smells in Copilot-Generated Python Code using CopilotProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering10.1145/3691620.3695290(2230-2234)Online publication date: 27-Oct-2024
https://dl.acm.org/doi/10.1145/3691620.3695290
Meijer WCombemale BWimmer MChechik MEgyed A(2024)Contract-based Validation of Conceptual Design Bugs for Engineering Complex Machine Learning SoftwareProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3688201(155-161)Online publication date: 22-Sep-2024
https://dl.acm.org/doi/10.1145/3652620.3688201
Mahbub RRahman MHabib M(2024)On the Prevalence, Evolution, and Impact of Code Smells in Simulation Modelling Software2024 IEEE International Conference on Source Code Analysis and Manipulation (SCAM)10.1109/SCAM63643.2024.00024(154-165)Online publication date: 7-Oct-2024
https://doi.org/10.1109/SCAM63643.2024.00024
Show More Cited By

Index Terms

The Scent of Deep Learning Code: An Empirical Study
1. Software and its engineering
  1. Software creation and management
    1. Designing software
      1. Software implementation planning
        Software design techniques

Recommendations

A review of code smell mining techniques

Over the past 15years, researchers presented numerous techniques and tools for mining code smells. It is imperative to classify, compare, and evaluate existing techniques and tools used for the detection of code smells because of their varying features ...
House of cards: code smells in open-source C# repositories
ESEM '17: Proceedings of the 11th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement

Background: Code smells are indicators of quality problems that make a software hard to maintain and evolve. Given the importance of smells in the source code's maintainability, many studies have explored the characteristics of smells and analyzed their ...
Code smells for Model-View-Controller architectures

Previous studies have shown the negative effects that low-quality code can have on maintainability proxies, such as code change- and defect-proneness. One of the symptoms of low-quality code are code smells, defined as sub-optimal implementation ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

June 2020

675 pages

ISBN:9781450375177

DOI:10.1145/3379597

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 September 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

MSR '20

Sponsor:

SIGSOFT

MSR '20: 17th International Conference on Mining Software Repositories

June 29 - 30, 2020

Seoul, Republic of Korea

Upcoming Conference

ICSE 2025

2025 IEEE/ACM 46th International Conference on Software Engineering

April 26 - May 3, 2025

Ottawa , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
464
Total Downloads

Downloads (Last 12 months)60
Downloads (Last 6 weeks)3

Reflects downloads up to 24 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhang BLiang PFeng QFu YLi ZFilkov VRay BZhou M(2024)Copilot-in-the-Loop: Fixing Code Smells in Copilot-Generated Python Code using CopilotProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering10.1145/3691620.3695290(2230-2234)Online publication date: 27-Oct-2024
https://dl.acm.org/doi/10.1145/3691620.3695290
Meijer WCombemale BWimmer MChechik MEgyed A(2024)Contract-based Validation of Conceptual Design Bugs for Engineering Complex Machine Learning SoftwareProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3688201(155-161)Online publication date: 22-Sep-2024
https://dl.acm.org/doi/10.1145/3652620.3688201
Mahbub RRahman MHabib M(2024)On the Prevalence, Evolution, and Impact of Code Smells in Simulation Modelling Software2024 IEEE International Conference on Source Code Analysis and Manipulation (SCAM)10.1109/SCAM63643.2024.00024(154-165)Online publication date: 7-Oct-2024
https://doi.org/10.1109/SCAM63643.2024.00024
Recupito GPecorelli FCatolino GLenarduzzi VTaibi DDi Nucci DPalomba F(2024)Technical debt in AI-enabled systems: On the prevalence, severity, impact, and management strategies for code and architectureJournal of Systems and Software10.1016/j.jss.2024.112151216(112151)Online publication date: Oct-2024
https://doi.org/10.1016/j.jss.2024.112151
Li ZZhang XWang WLiang PMo RTan JLiu H(2024)Automated detection of inter-language design smells in multi-language deep learning frameworksInformation and Software Technology10.1016/j.infsof.2024.107656(107656)Online publication date: Dec-2024
https://doi.org/10.1016/j.infsof.2024.107656
Song YXie XXu B(2024)When debugging encounters artificial intelligence: state of the art and open challengesScience China Information Sciences10.1007/s11432-022-3803-967:4Online publication date: 21-Feb-2024
https://doi.org/10.1007/s11432-022-3803-9
López LGómez CAyala C(2024)Insights on the Use of Software Design Principles in Machine Learning PipelinesProduct-Focused Software Process Improvement10.1007/978-3-031-78386-9_10(139-155)Online publication date: 27-Nov-2024
https://doi.org/10.1007/978-3-031-78386-9_10
Venigalla AChimalakonda S(2024)Is There a Correlation Between Readme Content and Project Meta‐Characteristics?Software: Practice and Experience10.1002/spe.3390Online publication date: 18-Nov-2024
https://doi.org/10.1002/spe.3390
Mo RZhang YWang YZhang SXiong PLi ZZhao Y(2023)Exploring the Impact of Code Clones on Deep Learning SoftwareACM Transactions on Software Engineering and Methodology10.1145/360718132:6(1-34)Online publication date: 3-Jul-2023
https://dl.acm.org/doi/10.1145/3607181
Kim JLee E(2023)An Empirical Study on Code Smell Introduction and Removal in Deep Learning Software ProjectsInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402350014633:05(765-786)Online publication date: 14-Apr-2023
https://doi.org/10.1142/S0218194023500146
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents