Last Diff Analyzer: Multi-language Automated Approver for Behavior-Preserving Code Revisions (ESEC/FSE 2023 - Industry Papers)

Sun 3 - Sat 9 December 2023 San Francisco, California, United States

Who

Yuxin Wang, Adam Welc, Lazaro Clapp, Lingchao Chen

Track

ESEC/FSE 2023 Industry Papers

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 5 Dec 2023 14:15 - 14:30 at Golden Gate C2 - Software Evolution I Chair(s): Rangeet Pan

Abstract

Code review is a crucial step in ensuring the quality and maintainability of software systems. However, this process can be time-consuming and resource-intensive, especially in large-scale projects where a significant number of code changes are submitted every day. Fortunately, not all code changes require human reviews, as some may only contain syntactic modifications that do not alter the behavior of the system, such as format changes, variable / function renamings, and constant extractions.

In this paper, we propose a multi-language automated code approver — Last Diff Analyzer for Go and Java, which is able to detect if a reviewable incremental unit of code change (diff) contains only changes that do not modify system behavior. It is built on top of a novel multi-language static analysis framework that unifies common features of multiple languages while keeping unique language constructs separate. This makes it easy to extend to other languages such as TypeScript, Kotlin, Swift, and others. Besides skipping unnecessary code reviews, Last Diff Analyzer could be further applied to skip certain resource-intensive end-to-end (E2E) tests for auto-approved diffs for significant reduction of resource usage. We have deployed the analyzer at scale within Uber, and data collected in production shows that approximately 15% of analyzed diffs are auto-approved weekly for code reviews. Furthermore, 13.5% reduction in server node usage dedicated to E2E tests (measured by number of executed E2E tests) is observed as a result of skipping E2E tests, compared to the node usage if Last Diff Analyzer were not enabled.

DOI

https://doi.org/10.1145/3611643.3613870

Yuxin Wang

Uber Technologies

United States

Adam Welc

Mysten Labs

United States

Lazaro Clapp

Uber Technologies Inc

Lingchao Chen

Uber Technologies

United States

Media

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 5 Dec
Displayed time zone: Pacific Time (US & Canada) change

14:00 - 15:30	Software Evolution IIndustry Papers / Research Papers / Demonstrations at Golden Gate C2 Chair(s): Rangeet Pan IBM Research

14:00 15m Talk		Understanding Solidity Event Logging Practices in the Wild Research Papers Lantian Li Shandong University, Yejian Liang Shandong University, Zhihao Liu Shandong University, Zhongxing Yu Shandong University Media Attached
14:15 15m Talk		Last Diff Analyzer: Multi-language Automated Approver for Behavior-Preserving Code Revisions Industry Papers Yuxin Wang Uber Technologies, Adam Welc Mysten Labs, Lazaro Clapp Uber Technologies Inc, Lingchao Chen Uber Technologies DOI Media Attached
14:30 15m Talk		EvaCRC: Evaluating Code Review Comments Research Papers Lanxin Yang Nanjing University, Jinwei Xu Nanjing University, YiFan Zhang Nanjing University, He Zhang Nanjing University, Alberto Bacchelli University of Zurich Media Attached
14:45 15m Talk		HyperDiff: Computing Source Code Diffs at Scale Research Papers Quentin Le-dilavrec Univ. Rennes, IRISA, INRIA, Djamel Eddine Khelladi CNRS, IRISA, University of Rennes, Arnaud Blouin Univ Rennes, INSA Rennes, Inria, CNRS, IRISA, Jean-Marc Jézéquel Univ Rennes - IRISA Media Attached
15:00 7m Talk		npm-follower: A Complete Dataset Tracking the NPM Ecosystem Demonstrations Donald Pinckney Northeastern University, Federico Cassano Northeastern University, Arjun Guha Northeastern University and Roblox, Jonathan Bell Northeastern University Media Attached
15:08 7m Talk		Issue Report Validation in an Industrial Context Industry Papers Ethem Utku Aktas Softtech Inc., Ebru Cakmak Microsoft EMEA, Mete Cihad Inan Softtech Research and Development, Cemal Yilmaz Sabancı University DOI Media Attached
15:15 15m Talk		Dead Code Removal at Meta: Automatically Deleting Millions of Lines of Code and Petabytes of Deprecated Data Industry Papers Will Shackleton Meta, Katriel Cohn-Gordon Meta, Peter C Rigby Meta; Concordia University, Rui Abreu Meta, James Gill Meta, Nachiappan Nagappan Meta, Karim Nakad Meta, Ioannis Papagiannis Meta, Luke Petre Meta, Giorgi Megreli Meta, Patrick Riggs Meta, James Saindon Meta DOI