Article (Scientific journals)
Preserving data privacy in machine learning systems
EL MESTARI, Soumia Zohra; LENZINI, Gabriele; DEMIRCI, Huseyin
2024In Computers and Security, 137, p. 103605
Peer Reviewed verified by ORBi
 

Files


Full Text
1-s2.0-S0167404823005151-main.pdf
Author postprint (1.49 MB) Creative Commons License - Attribution, Non-Commercial
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Trustworthy machine learning; Machine learning; Differential privacy; Homomorphic encryption; Functional encryption; Secure multiparty computation; Privacy threats
Abstract :
[en] The wide adoption of Machine Learning to solve a large set of real-life problems came with the need to collect and process large volumes of data, some of which are considered personal and sensitive, raising serious concerns about data protection. Privacy-enhancing technologies (PETs) are often indicated as a solution to protect personal data and to achieve a general trustworthiness as required by current EU regulations on data protection and AI. However, an off-the-shelf application of PETs is insufficient to ensure a high-quality of data protection, which one needs to understand. This work systematically discusses the risks against data protection in modern Machine Learning systems taking the original perspective of the data owners, who are those who hold the various data sets, data models, or both, throughout the machine learning life cycle and considering the different Machine Learning architectures. It argues that the origin of the threats, the risks against the data, and the level of protection offered by PETs depend on the data processing phase, the role of the parties involved, and the architecture where the machine learning systems are deployed. By offering a framework in which to discuss privacy and confidentiality risks for data owners and by identifying and assessing privacy-preserving countermeasures for machine learning, this work could facilitate the discussion about compliance with EU regulations and directives. We discuss current challenges and research questions that are still unsolved in the field. In this respect, this paper provides researchers and developers working on machine learning with a comprehensive body of knowledge to let them advance in the science of data protection in machine learning field as well as in closely related fields such as Artificial Intelligence.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > IRiSC - Socio-Technical Cybersecurity
Disciplines :
Computer science
Author, co-author :
EL MESTARI, Soumia Zohra  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > IRiSC
LENZINI, Gabriele  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > IRiSC
DEMIRCI, Huseyin ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > IRiSC
External co-authors :
no
Language :
English
Title :
Preserving data privacy in machine learning systems
Publication date :
February 2024
Journal title :
Computers and Security
ISSN :
0167-4048
Publisher :
Elsevier BV
Volume :
137
Pages :
103605
Peer reviewed :
Peer Reviewed verified by ORBi
Focus Area :
Computational Sciences
European Projects :
H2020 - 956562 - LeADS - Legality Attentive Data Scientists
Funders :
European Commission
Union Européenne
Funding number :
956562
Funding text :
This work has been supported by the EU 956562, MSCA-ITN-2020 - Innovative Training Networks, “Legality Attentive Data Scientists” (LeADS) project.
Available on ORBilu :
since 19 December 2023

Statistics


Number of views
188 (14 by Unilu)
Number of downloads
539 (8 by Unilu)

Scopus citations®
 
35
Scopus citations®
without self-citations
35
OpenAlex citations
 
34

publications
39
supporting
0
mentioning
4
contrasting
0
Smart Citations
39
0
4
0
Citing PublicationsSupportingMentioningContrasting
View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Bibliography


Similar publications



Contact ORBilu