Paper published in a book (Scientific congresses, symposiums and conference proceedings)
BugDoc: Algorithms to Debug Computational Processes
DE PAULA LOURENCO, Raoni; Freire, Juliana; Shasha, Dennis
2020In SIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Peer reviewed
 

Files


Full Text
3318464.3389763.pdf
Author postprint (1.48 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
provenance; workflow debugging; Computational process; Experimental evaluation; Large scale simulations; New approaches; Processing software; Reproducibilities; Scientific experiments; State of the art; Software; Information Systems; Computer Science - Databases
Abstract :
[en] Data analysis for scientific experiments and enterprises, large-scale simulations, and machine learning tasks all entail the use of complex computational pipelines to reach quantitative and qualitative conclusions. If some of the activities in a pipeline produce erroneous outputs, the pipeline may fail to execute or produce incorrect results. Inferring the root cause(s) of such failures is challenging, usually requiring time and much human thought, while still being error-prone. We propose a new approach that makes use of iteration and provenance to automatically infer the root causes and derive succinct explanations of failures. Through a detailed experimental evaluation, we assess the cost, precision, and recall of our approach compared to the state of the art. Our experimental data and processing software is available for use, reproducibility, and enhancement.
Disciplines :
Computer science
Author, co-author :
DE PAULA LOURENCO, Raoni  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal ; NYU - New York University [US-NY]
Freire, Juliana;  New York University, New York, United States
Shasha, Dennis;  New York University, New York, United States
External co-authors :
yes
Language :
English
Title :
BugDoc: Algorithms to Debug Computational Processes
Publication date :
14 June 2020
Event name :
Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Event place :
Portland, Usa
Event date :
14-06-2020 => 19-06-2020
Audience :
International
Main work title :
SIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Publisher :
Association for Computing Machinery
ISBN/EAN :
978-1-4503-6735-6
Peer reviewed :
Peer reviewed
Funders :
ACM SIGMOD
Funding text :
Acknowledgments. We thank Data X-Ray and Explanation Tables authors for sharing their code with us. We are also grateful to Fernando Chirigati, Neel Dey, and Peter Bailis for providing the real-world pipelines. This work has been supported in part by NSF grants MCB-1158273, IOS-1339362, and MCB-1412232, CNPq (Brazil) grant 209623/2014-4, the DARPA D3M program, and NYU WIRELESS.
Commentary :
To appear in SIGMOD 2020. arXiv admin note: text overlap with arXiv:2002.04640
Available on ORBilu :
since 22 November 2023

Statistics


Number of views
76 (0 by Unilu)
Number of downloads
21 (0 by Unilu)

Scopus citations®
 
14
Scopus citations®
without self-citations
11
OpenCitations
 
4
OpenAlex citations
 
9
WoS citations
 
7

Bibliography


Similar publications



Contact ORBilu