Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Debugging machine learning pipelines
DE PAULA LOURENCO, Raoni; Freire, Juliana; Shasha, Dennis
2019In Proceedings of the 3rd Workshop on Data Management for End-To-End Machine Learning, DEEM 2019 - In conjunction with the 2019 ACM SIGMOD/PODS Conference
Peer reviewed
 

Files


Full Text
3329486.3329489.pdf
Author postprint (1.04 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Error prones; Experimental evaluation; New approaches; Reproducibilities; Root cause; Root cause of failures; Source codes; State of the art; Software; Information Systems; Computer Science - Learning; Computer Science - Databases; Statistics - Machine Learning
Abstract :
[en] Machine learning tasks entail the use of complex computational pipelines to reach quantitative and qualitative conclusions. If some of the activities in a pipeline produce erroneous or uninformative outputs, the pipeline may fail or produce incorrect results. Inferring the root cause of failures and unexpected behavior is challenging, usually requiring much human thought, and is both time consuming and error prone. We propose a new approach that makes use of iteration and provenance to automatically infer the root causes and derive succinct explanations of failures. Through a detailed experimental evaluation, we assess the cost, precision, and recall of our approach compared to the state of the art. Our source code and experimental data will be available for reproducibility and enhancement.
Disciplines :
Computer science
Author, co-author :
DE PAULA LOURENCO, Raoni  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal ; NYU - New York University [US-NY]
Freire, Juliana;  New York University, United States
Shasha, Dennis;  New York University, United States
External co-authors :
yes
Language :
English
Title :
Debugging machine learning pipelines
Publication date :
30 June 2019
Event name :
Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning
Event place :
Amsterdam, Nld
Event date :
30-06-2019
Main work title :
Proceedings of the 3rd Workshop on Data Management for End-To-End Machine Learning, DEEM 2019 - In conjunction with the 2019 ACM SIGMOD/PODS Conference
Publisher :
Association for Computing Machinery
ISBN/EAN :
978-1-4503-6797-4
Peer reviewed :
Peer reviewed
Funders :
ACM Special Interest Group on Management of Data (SIGMOD)
Commentary :
10 pages
Available on ORBilu :
since 22 November 2023

Statistics


Number of views
35 (1 by Unilu)
Number of downloads
19 (0 by Unilu)

Scopus citations®
 
6
Scopus citations®
without self-citations
5
OpenCitations
 
10
OpenAlex citations
 
24

Bibliography


Similar publications



Contact ORBilu