Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair (ESEC/FSE 2023 - Research Papers)

Sun 3 - Sat 9 December 2023 San Francisco, California, United States

Who

Yuxiang Wei, Chunqiu Steven Xia, Lingming Zhang

Track

ESEC/FSE 2023 Research Papers

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 5 Dec 2023 11:45 - 12:00 at Golden Gate C3 - Automated Repair I Chair(s): Shin Hwei Tan

Abstract

During Automated Program Repair (APR), it can be challenging to synthesize correct patches for real-world systems in general-purpose programming languages. Recent Large Language Models (LLMs) have been shown to be helpful “copilots” in assisting developers with various coding tasks, and have also been directly applied for patch synthesis. However, most LLMs treat programs as sequences of tokens, meaning that they are ignorant of the underlying semantics constraints of the target programming language. This results in plenty of statically invalid generated patches, impeding the practicality of the technique. Therefore, we propose Repilot, a framework to further copilot the AI “copilots” (i.e., LLMs) by synthesizing more valid patches during the repair process. Our key insight is that many LLMs produce outputs autoregressively (i.e., token by token), resembling human writing programs, which can be significantly boosted and guided through a Completion Engine. Repilot synergistically synthesizes a candidate patch through the interaction between an LLM and a Completion Engine, which 1) prunes away infeasible tokens suggested by the LLM and 2) proactively completes the token based on the suggestions provided by the Completion Engine. Our evaluation on a subset of the widely-used Defects4j 1.2 and 2.0 datasets shows that Repilot fixes 66 and 50 bugs, respectively, surpassing the best-performing baseline by 14 and 16 bugs fixed. More importantly, Repilot is capable of producing more valid and correct patches than the base LLM when given the same generation budget.

Link to Preprint

https://arxiv.org/pdf/2309.00608.pdf

Yuxiang Wei

University of Illinois at Urbana-Champaign

United States

Chunqiu Steven Xia

University of Illinois at Urbana-Champaign

United States

Lingming Zhang

University of Illinois at Urbana-Champaign

United States

Media

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 5 Dec
Displayed time zone: Pacific Time (US & Canada) change

11:00 - 12:30	Automated Repair IResearch Papers / Industry Papers at Golden Gate C3 Chair(s): Shin Hwei Tan Concordia University

11:00 15m Talk		RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair Research Papers Weishi Wang Nanyang Technological University, Yue Wang Salesforce Research, Shafiq Joty Salesforce Research, Steven C.H. Hoi Salesforce Research Asia Media Attached
11:15 15m Talk		From Leaks to Fixes: Automated Repairs for Resource Leak Warnings Research Papers Akshay Utture Uber Technologies Inc., Jens Palsberg University of California, Los Angeles (UCLA) Pre-print Media Attached
11:30 15m Talk		InferFix: End-to-End Program Repair with LLMs Industry Papers Matthew Jin , Syed Shahriar University of California at Los Angeles, Michele Tufano Microsoft, Xin Shi Microsoft Corporation, Shuai Lu Microsoft Research, Neel Sundaresan Microsoft, Alexey Svyatkovskiy Microsoft DOI
11:45 15m Research paper		Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair Research Papers Yuxiang Wei University of Illinois at Urbana-Champaign, Chunqiu Steven Xia University of Illinois at Urbana-Champaign, Lingming Zhang University of Illinois at Urbana-Champaign Pre-print Media Attached
12:00 15m Talk		SmartFix: Fixing Vulnerable Smart Contracts by Accelerating Generate-and-Verify Repair using Statistical Models Research Papers Sunbeom So Korea University, Hakjoo Oh Korea University Media Attached
12:15 15m Talk		Automatically Resolving Dependency-Conflict Building Failures via Behavior-Consistent Loosening of Library Version Constraints Research Papers Huiyan Wang Nanjing University, Shuguan Liu Nanjing University, Lingyu Zhang Nanjing University, Chang Xu Nanjing University Media Attached