Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- short-paperNovember 2023
Detecting Overfitting of Machine Learning Techniques for Automatic Vulnerability Detection
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 2189–2191https://doi.org/10.1145/3611643.3617845Recent results of machine learning for automatic vulnerability detection have been very promising indeed: Given only the source code of a function f, models trained by machine learning techniques can decide if f contains a security flaw with up to 70% ...
- research-articleNovember 2023
DeepRover: A Query-Efficient Blackbox Attack for Deep Neural Networks
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1384–1394https://doi.org/10.1145/3611643.3616370Deep neural networks (DNNs) achieved a significant performance breakthrough over the past decade and have been widely adopted in various industrial domains. However, a fundamental problem regarding DNN robustness is still not adequately addressed, which ...
- research-articleNovember 2023
Efficient Text-to-Code Retrieval with Cascaded Fast and Slow Transformer Models
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 388–400https://doi.org/10.1145/3611643.3616369The goal of semantic code search or text-to-code search is to retrieve a semantically relevant code snippet from an existing code database using a natural language query. When constructing a practical semantic code search system, existing approaches fail ...
- research-articleNovember 2023
DeMinify: Neural Variable Name Recovery and Type Inference
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 758–770https://doi.org/10.1145/3611643.3616368To avoid the exposure of original source code, the variable names deployed in the wild are often replaced by short, meaningless names, thus making the code difficult to understand and be analyzed. We introduce DeMinify, a Deep-Learning (DL)-based ...
An Extensive Study on Adversarial Attack against Pre-trained Models of Code
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 489–501https://doi.org/10.1145/3611643.3616356Transformer-based pre-trained models of code (PTMC) have been widely utilized and have achieved state-of-the-art performance in many mission-critical applications. However, they can be vulnerable to adversarial attacks through identifier substitution or ...
-
- research-articleNovember 2023
Log Parsing with Generalization Ability under New Log Types
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 425–437https://doi.org/10.1145/3611643.3616355Log parsing, which converts semi-structured logs into structured logs, is the first step for automated log analysis. Existing parsers are still unsatisfactory in real-world systems due to new log types in new-coming logs. In practice, available logs ...
- research-articleNovember 2023
Multilingual Code Co-evolution using Large Language Models
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 695–707https://doi.org/10.1145/3611643.3616350Many software projects implement APIs and algorithms in multiple programming languages. Maintaining such projects is tiresome, as developers have to ensure that any change (e.g., a bug fix or a new feature) is being propagated, timely and without errors, ...
BigDataflow: A Distributed Interprocedural Dataflow Analysis Framework
- Zewen Sun,
- Duanchen Xu,
- Yiyu Zhang,
- Yun Qi,
- Yueyang Wang,
- Zhiqiang Zuo,
- Zhaokang Wang,
- Yue Li,
- Xuandong Li,
- Qingda Lu,
- Wenwen Peng,
- Shengjian Guo
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1431–1443https://doi.org/10.1145/3611643.3616348Abstract: Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehension, etc. Despite its importance, ...
Understanding the Bug Characteristics and Fix Strategies of Federated Learning Systems
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1358–1370https://doi.org/10.1145/3611643.3616347Federated learning (FL) is an emerging machine learning paradigm that aims to address the problem of isolated data islands. To preserve privacy, FL allows machine learning models and deep neural networks to be trained from decentralized data kept ...
- research-articleNovember 2023
Commit-Level, Neural Vulnerability Detection and Assessment
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1024–1036https://doi.org/10.1145/3611643.3616346Software Vulnerabilities (SVs) are security flaws that are exploitable in cyber-attacks. Delay in the detection and assessment of SVs might cause serious consequences due to the unknown impacts on the attacked systems. The state-of-the-art approaches ...
NeuRI: Diversifying DNN Generation via Inductive Rule Inference
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 657–669https://doi.org/10.1145/3611643.3616337Deep Learning (DL) is prevalently used in various industries to improve decision-making and automate processes, driven by the ever-evolving DL libraries and compilers. The correctness of DL systems is crucial for trust in DL applications. As such, the ...
- research-articleNovember 2023
DecompoVision: Reliability Analysis of Machine Vision Components through Decomposition and Reuse
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 541–552https://doi.org/10.1145/3611643.3616333Analyzing reliability of Machine Vision Components (MVC) against scene changes (such as rain or fog) in their operational environment is crucial for safety-critical applications. Safety analysis relies on the availability of precisely specified and, ...
A Generative and Mutational Approach for Synthesizing Bug-Exposing Test Cases to Guide Compiler Fuzzing
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1127–1139https://doi.org/10.1145/3611643.3616332Random test case generation, or fuzzing, is a viable means for uncovering compiler bugs. Unfortunately, compiler fuzzing can be time-consuming and inefficient with purely randomly generated test cases due to the complexity of modern compilers. We present ...
TransMap: Pinpointing Mistakes in Neural Code Translation
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 999–1011https://doi.org/10.1145/3611643.3616322Automated code translation between programming languages can greatly reduce the human effort needed in learning new languages or in migrating code. Recent neural machine translation models, such as Codex, have been shown to be effective on many code ...
Pitfalls in Experiments with DNN4SE: An Analysis of the State of the Practice
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 528–540https://doi.org/10.1145/3611643.3616320Software engineering (SE) techniques are increasingly relying on deep learning approaches to support many SE tasks, from bug triaging to code generation. To assess the efficacy of such techniques researchers typically perform controlled experiments. ...
- research-articleNovember 2023
Outage-Watch: Early Prediction of Outages using Extreme Event Regularizer
- Shubham Agarwal,
- Sarthak Chakraborty,
- Shaddy Garg,
- Sumit Bisht,
- Chahat Jain,
- Ashritha Gonuguntla,
- Shiv Saini
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 682–694https://doi.org/10.1145/3611643.3616316Cloud services are omnipresent and critical cloud service failure is a fact of life. In order to retain customers and prevent revenue loss, it is important to provide high reliability guarantees for these services. One way to do this is by predicting ...
Revisiting Neural Program Smoothing for Fuzzing
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 133–145https://doi.org/10.1145/3611643.3616308Testing with randomly generated inputs (fuzzing) has gained significant traction due to its capacity to expose program vulnerabilities automatically. Fuzz testing campaigns generate large amounts of data, making them ideal for the application of machine ...
A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 605–617https://doi.org/10.1145/3611643.3616307Just-in-Time Software Defect Prediction (JIT-SDP) can be seen as an online learning problem where additional software changes produced over time may be labeled and used to create training examples. These training examples form a data stream that can be ...
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 895–907https://doi.org/10.1145/3611643.3616304The use of modern Natural Language Processing (NLP) techniques has shown to be beneficial for software engineering tasks, such as vulnerability detection and type inference. However, training deep NLP models requires significant computational resources. ...
- research-articleNovember 2023
CodeMark: Imperceptible Watermarking for Code Datasets against Neural Code Completion Models
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringPages 1561–1572https://doi.org/10.1145/3611643.3616297Code datasets are of immense value for training neural-network-based code completion models, where companies or organizations have made substantial investments to establish and process these datasets. Unluckily, these datasets, either built for ...