Paper published in a journal (Scientific congresses, symposiums and conference proceedings)
Transforming IoT Data Preprocessing: A Holistic, Normalized and Distributed Approach
Tawakuli, Amal; Kaiser, Daniel; Engel, Thomas
2022In The Fifth International Workshop on Data: Acquisition To Analysis
Peer reviewed
 

Files


Full Text
deltawing.pdf
Author preprint (1.18 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Data Preprocessing; Data Quality; Edge-Cloud Collaborative Systems; Internet of Things; Data Cleaning; Normalization
Abstract :
[en] Data preprocessing is an integral part of Artificial Intelligence (AI) pipelines. It transforms raw data into input data that fulfill algorithmic criteria and improve prediction accuracy. As the adoption of Internet of Things (IoT) gains more momentum, the data volume generated from the edge is exponentially increasing that far exceeds any expansion of infrastructure. Social responsibilities and regulations (e.g., GDPR) must also be adhered when handling IoT data. In addition, we are currently witnessing a shift towards distributing AI to the edge. The aforementioned reasons render the distribution of data preprocessing to the edge an urgent requirement. In this paper, we introduce a modern data preprocessing framework that consists of two main parts. Part1 is a design tool that reduces the complexity and costs of the data preprocessing phase for AI via generalization and normalization. The design tool is a standard template that maps specific techniques into abstract categories and highlights dependencies between them. In addition, it presents a holistic notion of data preprocessing that is not limited to data cleaning. The second part is an IoT tool that adopts the edge-cloud collaboration model to progressively improve the quality of the data. It includes a synchronization mechanism that ensures adaptation to changes in data characteristics and a coordination mechanism that ensures correct and complete execution of preprocessing plans between the cloud and the edge. The paper includes an empirical analysis of the framework using a developed prototype and an automotive use-case. Our results demonstrate reductions in resource consumption (e.g., energy, bandwidth) while maintaining the value and integrity of the data.
Disciplines :
Computer science
Author, co-author :
Tawakuli, Amal ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
Kaiser, Daniel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PI Engel
Engel, Thomas ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
External co-authors :
no
Language :
English
Title :
Transforming IoT Data Preprocessing: A Holistic, Normalized and Distributed Approach
Publication date :
2022
Event name :
The Fifth International Workshop on Data: Acquisition To Analysis
Event place :
Boston, United States
Event date :
06-11-2022 to 09-11-2022
Audience :
International
Journal title :
The Fifth International Workshop on Data: Acquisition To Analysis
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 14 October 2022

Statistics


Number of views
101 (3 by Unilu)
Number of downloads
0 (0 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
1

Bibliography


Similar publications



Contact ORBilu