Paper published on a website (Scientific congresses, symposiums and conference proceedings)
Striking the Balance: Generalization vs. Memorization in Anonymization and De-anonymization through LLMs
KOKMEL, Meliane Angele; ABBAS, Antragama Ewa; TCHAPPI HAMAN, Igor
2025Proceedings of The 8th International Conference on Emerging Data and Industry (EDI40)
Peer reviewed
 

Files


Full Text
Kokmel_et_al_2025_Striking the Balance_Generalization vs. Memorization in Anonymization and De-anonymization through LLMs.pdf
Publisher postprint (478.13 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Anonymization; Diversity in AI; Generalization; LLMs; De-anonymization
Abstract :
[en] Text anonymization aims to enable the secure sharing of information between parties. One of the main challenges in data anonymization is achieving a balance between ensuring data privacy and maintaining data utility. To address these challenges, recent studies have explored the use of Large Language Models (LLMs), which have shown improved performance on datasets from Europe. Based on these findings, this paper aims to create a dataset from less explored parts of the world, specifically Africa, to assess the relevance of LLMs on diverse datasets and to discuss the generalization of the results. Additionally, this paper proposes an evaluation framework for assessing various anonymization techniques, including those utilizing LLMs. The performance of these techniques is assessed using several metrics, such as BERTScore for semantic evaluation and Information Loss for utility preservation.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > FINATRAX - Digital Financial Services and Cross-organizational Digital Transformations
Disciplines :
Management information systems
Computer science
Author, co-author :
KOKMEL, Meliane Angele ;  University of Luxembourg > Faculty of Science, Technology and Medicine > Department of Computer Science > Team Leon VAN DER TORRE
ABBAS, Antragama Ewa  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > FINATRAX
TCHAPPI HAMAN, Igor  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > FINATRAX
External co-authors :
no
Language :
English
Title :
Striking the Balance: Generalization vs. Memorization in Anonymization and De-anonymization through LLMs
Publication date :
2025
Event name :
Proceedings of The 8th International Conference on Emerging Data and Industry (EDI40)
Event place :
Patras, Grc
Event date :
22-04-2025 => 24-04-2025
By request :
Yes
Audience :
International
Peer reviewed :
Peer reviewed
Focus Area :
Security, Reliability and Trust
Development Goals :
9. Industry, innovation and infrastructure
FnR Project :
FNR13342933 - DFS - Paypal-fnr Pearl Chair In Digital Financial Services, 2019 (01/01/2020-31/12/2024) - Gilbert Fridgen
Name of the research project :
R-AGR-3728 - PEARL/IS/13342933/DFS - FRIDGEN Gilbert
Funders :
FNR - Luxembourg National Research Fund
Funding number :
13342933
Funding text :
This research was funded in whole by the Luxembourg National Research Fund (FNR) and PayPal, PEARL grant reference 13342933/Gilbert Fridgen. For the purpose of open access, and in fulfillment of the obligations arising from the grant agreement, the author has applied a Creative Commons Attribution 4.0 International (CC BY 4.0) license to any Author Accepted Manuscript version arising from this submission.
Available on ORBilu :
since 07 August 2025

Statistics


Number of views
96 (1 by Unilu)
Number of downloads
48 (1 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu