Tue 5 Dec 2023 15:15 - 15:30 at Golden Gate C2 - Software Evolution I Chair(s): Rangeet Pan

Software constantly evolves in response to user needs: new features are built, deployed, mature and grow old, and eventually their usage drops enough to merit switching them off. In any large codebase, this feature lifecycle can naturally lead to retaining unnecessary code and data. Removing these respects users’ privacy expectations, as well as helping engineers to work efficiently. In prior software engineering research, we have found little evidence of code deprecation or dead-code removal at industrial scale. We describe Systematic Code and Asset Removal Framework (SCARF), a product deprecation system to assist engineers working in large codebases. SCARF identifies unused code and data assets and safely removes them. It operates fully automatically, including committing code and dropping database tables. It also gathers developer input where it cannot take automated actions, leading to further removals. Dead code removal increases the quality and consistency of large codebases, aids with knowledge management and improves reliability. SCARF has had an important impact at Meta. In the last year alone, it has removed petabytes of data across 12.8 million distinct assets, and deleted over 104 million lines of code.

Tue 5 Dec

Displayed time zone: Pacific Time (US & Canada) change

14:00 - 15:30
Software Evolution IIndustry Papers / Research Papers / Demonstrations at Golden Gate C2
Chair(s): Rangeet Pan IBM Research
14:00
15m
Talk
Understanding Solidity Event Logging Practices in the Wild
Research Papers
Lantian Li Shandong University, Yejian Liang Shandong University, Zhihao Liu Shandong University, Zhongxing Yu Shandong University
Media Attached
14:15
15m
Talk
Last Diff Analyzer: Multi-language Automated Approver for Behavior-Preserving Code Revisions
Industry Papers
Yuxin Wang Uber Technologies, Adam Welc Mysten Labs, Lazaro Clapp Uber Technologies Inc, Lingchao Chen Uber Technologies
DOI Media Attached
14:30
15m
Talk
EvaCRC: Evaluating Code Review Comments
Research Papers
Lanxin Yang Nanjing University, Jinwei Xu Nanjing University, YiFan Zhang Nanjing University, He Zhang Nanjing University, Alberto Bacchelli University of Zurich
Media Attached
14:45
15m
Talk
HyperDiff: Computing Source Code Diffs at Scale
Research Papers
Quentin Le-dilavrec Univ. Rennes, IRISA, INRIA, Djamel Eddine Khelladi CNRS, IRISA, University of Rennes, Arnaud Blouin Univ Rennes, INSA Rennes, Inria, CNRS, IRISA, Jean-Marc Jézéquel Univ Rennes - IRISA
Media Attached
15:00
7m
Talk
npm-follower: A Complete Dataset Tracking the NPM Ecosystem
Demonstrations
Donald Pinckney Northeastern University, Federico Cassano Northeastern University, Arjun Guha Northeastern University and Roblox, Jonathan Bell Northeastern University
Media Attached
15:08
7m
Talk
Issue Report Validation in an Industrial Context
Industry Papers
Ethem Utku Aktas Softtech Inc., Ebru Cakmak Microsoft EMEA, Mete Cihad Inan Softtech Research and Development, Cemal Yilmaz Sabancı University
DOI Media Attached
15:15
15m
Talk
Dead Code Removal at Meta: Automatically Deleting Millions of Lines of Code and Petabytes of Deprecated Data
Industry Papers
DOI