research-article

Open access

Improved Program Repair Methods using Refactoring with GPT Models

Authors:

Ryosuke Ishizue,

Kazunori Sakamoto,

Hironori Washizaki,

Yoshiaki FukazawaAuthors Info & Claims

SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 1

Pages 569 - 575

https://doi.org/10.1145/3626252.3630875

Published: 07 March 2024 Publication History

Abstract

Teachers often utilize automatic program repair methods to provide feedback on submitted student code using model answer code. A state-of-the-art tool is Refactory, which achieves a high repair success rate and small patch size (less code repair) by refactoring code to expand the variety of correct code samples that can be referenced. However, Refactory has two major limitations. First, it cannot fix code with syntax errors. Second, it has difficulty fixing code when there are few correct submissions. Herein we propose a new method that combines Refactory and OpenAI's GPT models to address these issues and conduct a performance measurement experiment. The experiment uses a dataset consisting of 5 programming assignment problems and almost 1,800 real-life incorrect Python program submissions from 361 students for an introductory programming course at a large public university. The proposed method improves the repair success rate by 1-21% when the set of correct code samples is sufficient and the patch size is smaller than Refactory alone in 16-45% of the cases. When there was no set of correct code samples at all (only the model answer code was used as a reference for repair), method improves the repair success rate by 1-43% and the patch size is smaller than Refactory alone in 42-68% of the cases.

References

[1]

Sahil Bhatia, Pushmeet Kohli, and Rishabh Singh. 2018. Neuro-symbolic program corrector for introductory programming assignments. In Proceedings of the 40th International Conference on Software Engineering. 60--70.

Digital Library

[2]

Anderson Pinheiro Cavalcanti, Arthur Barbosa, Ruan Carvalho, Fred Freitas, Yi-Shan Tsai, and Rafael Ferreira Mello. 2021. Automatic feedback in online learning environments: A systematic literature review. Computers and Education: Artificial Intelligence, Vol. 2 (2021), 100027. https://doi.org/10.1016/j.caeai.2021.100027

[3]

Yihe Deng, Weitong Zhang, Zixiang Chen, and Quanquan Gu. 2023. Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves. (2023). arxiv: cs.CL/2311.04205

[4]

Loris D'Antoni, Roopsha Samanta, and Rishabh Singh. 2016. Qlose: Program repair with quantitative objectives. In International Conference on Computer Aided Verification. Springer, 383--401.

[5]

Claire Le Goues, Michael Pradel, and Abhik Roychoudhury. 2019. Automated Program Repair. Commun. ACM, Vol. 62, 12 (nov 2019), 56--65. https://doi.org/10.1145/3318162

Digital Library

[6]

Sumit Gulwani, Ivan Radivcek, and Florian Zuleger. 2018. Automated Clustering and Program Repair for Introductory Programming Assignments. In Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2018). Association for Computing Machinery, New York, NY, USA, 465--480. https://doi.org/10.1145/3192366.3192387

Digital Library

[7]

Yang Hu, Umair Z. Ahmed, Sergey Mechtaev, Ben Leong, and Abhik Roychoudhury. 2019. Re-Factoring Based Program Repair Applied to Programming Assignments. In 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). 388--398. https://doi.org/10.1109/ASE.2019.00044

Digital Library

[8]

Harshit Joshi, José Cambronero Sanchez, Sumit Gulwani, Vu Le, Gust Verbruggen, and Ivan Radivc ek. 2023. Repair is nearly generation: Multilingual program repair with llms. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 5131--5140.

Digital Library

[9]

Dongsun Kim, Jaechang Nam, Jaewoo Song, and Sunghun Kim. 2013. Automatic patch generation learned from human-written patches. In 2013 35th International Conference on Software Engineering (ICSE). IEEE, 802--811.

[10]

Leping Li, Hui Liu, Kejun Li, Yanjie Jiang, and Rui Sun. 2023. Generating Concise Patches for Newly Released Programming Assignments. IEEE Transactions on Software Engineering, Vol. 49, 1 (2023), 450--467. https://doi.org/10.1109/TSE.2022.3153522

[11]

Yi Li, Shaohua Wang, and Tien N. Nguyen. 2022. DEAR: A Novel Deep Learning-Based Approach for Automated Program Repair. In Proceedings of the 44th International Conference on Software Engineering (ICSE '22). Association for Computing Machinery, New York, NY, USA, 511--523. https://doi.org/10.1145/3510003.3510177

Digital Library

[12]

Hoang Duong Thien Nguyen, Dawei Qi, Abhik Roychoudhury, and Satish Chandra. 2013. Semfix: Program repair via semantic analysis. In 2013 35th International Conference on Software Engineering (ICSE). IEEE, 772--781.

[13]

Reudismam Rolim, Gustavo Soares, Loris D'Antoni, Oleksandr Polozov, Sumit Gulwani, Rohit Gheyi, Ryo Suzuki, and Björn Hartmann. 2017. Learning syntactic program transformations from examples. In 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 404--415.

Digital Library

[14]

Dominik Sobania, Martin Briesch, Carol Hanna, and Justyna Petke. 2023. An Analysis of the Automatic Bug Fixing Performance of ChatGPT. (2023). arxiv: cs.SE/2301.08653

[15]

Haoye Tian, Weiqi Lu, Tsz On Li, Xunzhu Tang, Shing-Chi Cheung, Jacques Klein, and Tegawendé F. Bissyandé. 2023. Is ChatGPT the Ultimate Programming Assistant -- How far is it? (2023). arxiv: cs.SE/2304.11938

[16]

Nikolai Tillmann, Jonathan De Halleux, Tao Xie, Sumit Gulwani, and Judith Bishop. 2013. Teaching and learning programming and software engineering via interactive gaming. In 2013 35th International Conference on Software Engineering (ICSE). IEEE, 1117--1126.

Digital Library

[17]

Szymon Wasik, Maciej Antczak, Jan Badura, Artur Laskowski, and Tomasz Sternal. 2018. A Survey on Online Judge Systems and Their Applications. ACM Comput. Surv., Vol. 51, 1, Article 3 (jan 2018), bibinfonumpages34 pages. https://doi.org/10.1145/3143560 io

Digital Library

Cited By

Lohr DKeuning HKiesler N(2025) You're (Not) My Type‐ Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks? Journal of Computer Assisted Learning10.1111/jcal.1310741:1Online publication date: 6-Jan-2025
https://doi.org/10.1111/jcal.13107
Humble N(2024)Risk management strategy for generative AI in computing education: how to handle the strengths, weaknesses, opportunities, and threats?International Journal of Educational Technology in Higher Education10.1186/s41239-024-00494-x21:1Online publication date: 11-Dec-2024
https://doi.org/10.1186/s41239-024-00494-x
Mišić MDodović M(2024)An assessment of large language models for OpenMP-based code parallelization: a user perspectiveJournal of Big Data10.1186/s40537-024-01019-z11:1Online publication date: 10-Nov-2024
https://doi.org/10.1186/s40537-024-01019-z
Show More Cited By

Index Terms

Improved Program Repair Methods using Refactoring with GPT Models
1. Applied computing
  1. Education
    1. Computer-assisted instruction
2. Social and professional topics
  1. Professional topics
    1. Computing education
      1. Computing education programs
        Computer science education

Recommendations

Automated program repair through the evolution of assembly code
ASE '10: Proceedings of the 25th IEEE/ACM International Conference on Automated Software Engineering

A method is described for automatically repairing legacy software at the assembly code level using evolutionary computation. The technique is demonstrated on Java byte code and x86 assembly programs, showing how to find program variations that correct ...
RepairNet: Contextual Sequence-to-Sequence Network for Automated Program Repair
Artificial Intelligence in Education
Abstract
Compile-time errors can wreak havoc for programmers – seasoned and novice. Often developers spend a lot of time debugging them. An automated system to repair such errors can be a useful aid to the developers for their productivity. In this work, ...
Constraint-Based Program Debugging Using Data Structure Repair
ICST '11: Proceedings of the 2011 Fourth IEEE International Conference on Software Testing, Verification and Validation

Developers have used data structure repair over the last few decades as an effective means to recover on-the-fly from errors in program state. Traditional repair techniques were based on dedicated repair routines, whereas more recent techniques have ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 1

March 2024

1583 pages

ISBN:9798400704239

DOI:10.1145/3626252

General Chairs:
Ben Stephenson
University of Calgary, Canada6000230660002306
,
Jeffrey A. Stone
Penn State University6000143960001439
,
Program Chairs:
Lina Battestilli
North Carolina State University, USA6000492360004923
,
Samuel A. Rebelsky
Grinnell College60028806
,
Libby Shoop
Macalester College60028787

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGCSE: ACM Special Interest Group on Computer Science Education

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 March 2024

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGCSE 2024

Sponsor:

SIGCSE

SIGCSE 2024: The 55th ACM Technical Symposium on Computer Science Education

March 20 - 23, 2024

OR, Portland, USA

Acceptance Rates

Overall Acceptance Rate 1,595 of 4,542 submissions, 35%

Upcoming Conference

SIGCSE TS 2025

Sponsor:
sigcse

The 56th ACM Technical Symposium on Computer Science Education

February 26 - March 1, 2025

Pittsburgh , PA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
692
Total Downloads

Downloads (Last 12 months)692
Downloads (Last 6 weeks)67

Reflects downloads up to 24 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Lohr DKeuning HKiesler N(2025) You're (Not) My Type‐ Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks? Journal of Computer Assisted Learning10.1111/jcal.1310741:1Online publication date: 6-Jan-2025
https://doi.org/10.1111/jcal.13107
Humble N(2024)Risk management strategy for generative AI in computing education: how to handle the strengths, weaknesses, opportunities, and threats?International Journal of Educational Technology in Higher Education10.1186/s41239-024-00494-x21:1Online publication date: 11-Dec-2024
https://doi.org/10.1186/s41239-024-00494-x
Mišić MDodović M(2024)An assessment of large language models for OpenMP-based code parallelization: a user perspectiveJournal of Big Data10.1186/s40537-024-01019-z11:1Online publication date: 10-Nov-2024
https://doi.org/10.1186/s40537-024-01019-z
Van Praet LHoobergs JSchrijvers THermans FBohrer R(2024)ASSIST: Automated Feedback Generation for Syntax and Logical Errors in Programming ExercisesProceedings of the 2024 ACM SIGPLAN International Symposium on SPLASH-E10.1145/3689493.3689981(66-76)Online publication date: 17-Oct-2024
https://dl.acm.org/doi/10.1145/3689493.3689981
Washizaki HYoshioka N(2024)AI Engineering Continuum: Concept and Expected Foundation2024 15th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA62523.2024.10786645(1-4)Online publication date: 17-Jul-2024
https://doi.org/10.1109/IISA62523.2024.10786645
de-Fitero-Dominguez DGarcia-Lopez EGarcia-Cabot AMartinez-Herraiz J(2024)Enhanced automated code vulnerability repair using large language modelsEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109291138:PAOnline publication date: 1-Dec-2024
https://dl.acm.org/doi/10.1016/j.engappai.2024.109291

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten