Reference : Transforming IoT Data Preprocessing: A Holistic, Normalized and Distributed Approach
Scientific congresses, symposiums and conference proceedings : Paper published in a journal
Engineering, computing & technology : Computer science
http://hdl.handle.net/10993/52454
Transforming IoT Data Preprocessing: A Holistic, Normalized and Distributed Approach
English
Tawakuli, Amal mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) >]
Kaiser, Daniel mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PI Engel >]
Engel, Thomas mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) >]
2022
The Fifth International Workshop on Data: Acquisition To Analysis
Yes
International
The Fifth International Workshop on Data: Acquisition To Analysis
06-11-2022 to 09-11-2022
Boston
USA
[en] Data Preprocessing ; Data Quality ; Edge-Cloud Collaborative Systems ; Internet of Things ; Data Cleaning ; Normalization
[en] Data preprocessing is an integral part of Artificial Intelligence (AI) pipelines. It transforms raw data into input data that fulfill algorithmic criteria and improve prediction accuracy. As the adoption of Internet of Things (IoT) gains more momentum, the data volume generated from the edge is exponentially increasing that far exceeds any expansion of infrastructure. Social responsibilities and regulations (e.g., GDPR) must also be adhered when handling IoT data. In addition, we are currently witnessing a shift towards distributing AI to the edge. The aforementioned reasons render the distribution of data preprocessing to the edge an urgent requirement. In this paper, we introduce a modern data preprocessing framework that consists of two main parts. Part1 is a design tool that reduces the complexity and costs of the data preprocessing phase for AI via generalization and normalization. The design tool is a standard template that maps specific techniques into abstract categories and highlights dependencies between them. In addition, it presents a holistic notion of data preprocessing that is not limited to data cleaning. The second part is an IoT tool that adopts the edge-cloud collaboration model to progressively improve the quality of the data. It includes a synchronization mechanism that ensures adaptation to changes in data characteristics and a coordination mechanism that ensures correct and complete execution of preprocessing plans between the cloud and the edge. The paper includes an empirical analysis of the framework using a developed prototype and an automotive use-case. Our results demonstrate reductions in resource consumption (e.g., energy, bandwidth) while maintaining the value and integrity of the data.
Researchers ; Professionals ; Students ; General public
http://hdl.handle.net/10993/52454
10.1145/3560905.3567762

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Limited access
deltawing.pdfAuthor preprint1.15 MBRequest a copy

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.