Tue 5 Dec 2023 16:15 - 16:30 at Golden Gate C3 - Fault Diagnosis and Root Cause Analysis I Chair(s): Akond Rahman

Performance degradation due to misconfiguration in software systems that violates SLOs (service-level objectives) is commonplace. Diagnosing and explaining the root causes of such performance violations in configurable software systems is often challenging due to their increasing complexity. Although there are many tools and techniques for diagnosing performance violations, they provide limited evidence to attribute causes of observed performance violations to specific configurations. This is because the configuration is not originally considered in those tools. This paper proposes DiagConfig, specifically designed to conduct configuration diagnosis of performance violations. It leverages static code analysis to track configuration option propagation, identifies performance-sensitive options, detects performance violations, and constructs cause-effect chains that help stakeholders better understand the relationship between configuration and performance violations. Experimental evaluations with eight real-world software demonstrate that DiagConfig produces fewer false positives than a state-of-the-art documentation analysis-based tool (i.e., 5 vs 41) in the identification of performance-sensitive options, and outperforms a statistics-based debugging tool in the diagnosis of performance violations caused by configuration changes, offering more comprehensive results (recall: 0.892 vs 0.289). Moreover, we also show that DiagConfig can accelerate auto-tuning by compressing configuration space.

Tue 5 Dec

Displayed time zone: Pacific Time (US & Canada) change

16:00 - 18:00
Fault Diagnosis and Root Cause Analysis IResearch Papers / Journal First / Industry Papers at Golden Gate C3
Chair(s): Akond Rahman Auburn University
16:00
15m
Talk
[Remote] Nezha: Interpretable Fine-Grained Root Causes Analysis for Microservices on Multi-Modal Observability Data
Research Papers
Guangba  Yu Sun Yat-Sen University, Pengfei Chen Sun Yat-Sen University, Yufeng Li Sun Yat-sen University, Hongyang Chen School of Computer Science and Engineering, Sun Yat-sen University, Xiaoyun Li Sun Yat-sen University, Zibin Zheng Sun Yat-sen University
Pre-print
16:15
15m
Full-paper
[Remote] DiagConfig: Configuration Diagnosis of Performance Violations in Configurable Software Systems
Research Papers
Zhiming Chen Sun Yat-sen University, Pengfei Chen Sun Yat-Sen University, Guangba  Yu Sun Yat-Sen University, Zilong He Sun Yat-Sen University, Genting Mai Sun Yat-sen University, Peipei Wang ByteDance Infrastructure System Lab
Pre-print Media Attached
16:30
15m
Talk
[Remote] Pre-training Code Representation with Semantic Flow Graph for Effective Bug Localization
Research Papers
Yali Du Shandong University, Zhongxing Yu Shandong University
Media Attached
16:45
15m
Talk
[Remote] A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction
Research Papers
Liyan Song Southern University of Science and Technology, China, Leandro Minku University of Birmingham, Cong Teng Southern University of Science and Technology, Xin Yao Southern University of Science and Technology
Pre-print Media Attached
17:00
15m
Talk
Trace Diagnostics for Signal-Based Temporal Properties
Journal First
Chaima Boufaied University of Ottawa, Claudio Menghi University of Bergamo; McMaster University, Domenico Bianculli University of Luxembourg, Lionel Briand University of Ottawa, Canada / University of Luxembourg, Luxembourg
Media Attached
17:15
15m
Talk
TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems
Industry Papers
Ruomeng Ding Microsoft, Chaoyun Zhang Microsoft, Lu Wang Microsoft Research, Yong Xu Microsoft Research, Minghua Ma Microsoft Research, Xiaomin Wu Microsoft, Meng Zhang , Qingjun Chen Microsoft 365, Xin Gao Microsoft 365, Xuedong Gao Microsoft 365, Hao Fan , Saravan Rajmohan Microsoft 365, Qingwei Lin Microsoft, Dongmei Zhang Microsoft Research
DOI Media Attached
17:30
15m
Talk
Triggering Modes in Spectrum-Based Multi-location Fault Localization
Industry Papers
Tung Dao Cvent, Na Meng Virginia Tech, ThanhVu Nguyen George Mason University
DOI Media Attached
17:45
15m
Talk
Automata-based Trace Analysis for Aiding Diagnosing GUI Testing Tools for Android
Research Papers
Enze Ma East China Normal University, Shan Huang East China Normal University, weigang he East China Normal University, Ting Su East China Normal University, Jue Wang Nanjing University, Huiyu Liu East China Normal University, Geguang Pu East China Normal University, Zhendong Su ETH Zurich
Media Attached