Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Augmenting Dark Patterns Text Data by Leveraging Large Language Models: A Multi-agent Framework and Parameter-Efficient Fine-Tuning
KOCYIGIT, Emre; LIGA, Davide; LENZINI, Gabriele
2025 • In Mualla, Yazan (Ed.) Advances in Explainability, Agents, and Large Language Models - 1st International Workshop on Causality, Agents and Large Models, CALM 2024, Proceedings
dark patterns; data augmentation; large language models; llama3; parameter-efficient fine-tuning; Agent parameters; Dark pattern; Data augmentation; Fine tuning; Language model; Large language model; Llama3; Multiagent framework; Parameter-efficient fine-tuning; Text data; Computer Science (all); Mathematics (all)
Abstract :
[en] Dark patterns, i.e., deceptive design patterns that employ manipulative strategies to deceive or force online users to take a decision against their interests, are widely available in online world. However, the variety and quantity of structured and labelled dark pattern datasets, which are critical for automated dark pattern detection, particularly for AI-based detection models, are limited. In this study, we leverage Large Language Models’ (LLM) sophisticated text data generation ability and propose a dark pattern text data augmentation method by utilizing a state of art open source language model and multi-agents framework, which has generator and controller models. Evaluation of the augmentation demonstrates that while increasing the data size, our proposal-based augmented data preserves the same dark pattern characteristics of the source data and maintains its diversity. We set forth that dark pattern text data can be generated even based on a few examples via prompt engineering techniques on the LLMs. We also show that our augmented data can be used to fine-tune pre-trained language models using Low-Rank Adaptation to enhance their robustness in detecting dark patterns.
Disciplines :
Computer science
Author, co-author :
KOCYIGIT, Emre ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > IRiSC
LIGA, Davide ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
LENZINI, Gabriele ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > IRiSC
External co-authors :
no
Language :
English
Title :
Augmenting Dark Patterns Text Data by Leveraging Large Language Models: A Multi-agent Framework and Parameter-Efficient Fine-Tuning
Publication date :
25 April 2025
Event name :
International Workshop on Causality, Agents and Large Models
Event place :
Kyoto, Japan
Event date :
18-11-2024 => 19-11-2024
Main work title :
Advances in Explainability, Agents, and Large Language Models - 1st International Workshop on Causality, Agents and Large Models, CALM 2024, Proceedings
Editor :
Mualla, Yazan
Publisher :
Springer Science and Business Media Deutschland GmbH
Deceptive Patterns (aka Dark Patterns)-spreading awareness since 2010 — decep-tive.design. https://www.deceptive.design. Accessed 14 June 2024
GitHub-eugeneyan/open-llms: A list of open LLMs available for commercial use. — github.com. https://github.com/eugeneyan/open-llms. Accessed 10 June 2024
Hugging Face-The AI community building the future. — huggingface.co. https://huggingface.co Accessed 24 June 2024
Introducing Meta Llama 3: The most capable openly available LLM to date — ai.meta.com. https://ai.meta.com/blog/meta-llama-3/. Accessed 10 June 2024
Meta Llama 3 — llama.meta.com. https://llama.meta.com/llama3/Accessed 24 June 2024
Dark commercial patterns. https://www.oecd-ilibrary.org/docserver/44f5e846-en.pdf?expires=1707456299&id=id&accname=guest& checksum=063A07EB53611E9EB1941F5DECEF38C3 (2022)
Dark pattern visual detection dataset. https://universe.roboflow.com/syd-help/dark-pattern-_visual-detection (2023). 07 June 2024
Guidelines 03/2022 on deceptive design patterns in social media platform interfaces: how to recognise and avoid them-version 2.0. https://edpb.europa. eu/system/files/2023-02/edpb_03-2022_guidelines_on_deceptive_design_ patterns_in_social_media_platform_interfaces_v2_en_0.pdf (2023). Accessed 10 Nov 2023
Bayer, M., Kaufhold, M.A., Buchhold, B., Keller, M., Dallmeyer, J., Reuter, C.: Data augmentation in natural language processing: a novel text generation approach for long and short text classifiers. Int. J. Mach. Learn. Cybern. 14(1), 135– 150 (2023)
Bayer, M., Kaufhold, M.A., Reuter, C.: A survey on data augmentation for text classification. ACM Comput. Surv. 55(7), 1–39 (2022)
Bösch, C., Erb, B., Kargl, F., Kopp, H., Pfattheicher, S.: Tales from the dark side: Privacy dark strategies and privacy dark patterns. Proce. Priv. Enhancing Technol. (2016)
Cara, C., et al.: Dark patterns in the media: a systematic review. Netw. Intell. Stud. 7(14), 105–113 (2019)
Chalkidis, I., Fergadiotis, M., Malakasiotis, P., Aletras, N., Androutsopou-los, I.: Legal-bert: The muppets straight out of law school. arXiv preprint arXiv:2010.02559 (2020)
Chang, Y., et al.: A survey on evaluation of large language models. ACM Trans. Intell. Syst. Technol. (2023)
Chang, Y., et al.: A survey on evaluation of large language models. ACM Trans. Intell. Syst. Technol. 15(3), 1–45 (2024)
Chen, J., et al.: Unveiling the tricks: automated detection of dark patterns in mobile applications. In: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, pp. 1–20 (2023)
Chlap, P., Min, H., Vandenberg, N., Dowling, J., Holloway, L., Haworth, A.: A review of medical image data augmentation techniques for deep learning applications. J. Med. Imaging Radiat. Oncol. 65(5), 545–563 (2021)
Commission, F.T.: Bringing dark patterns to light (2022)
Dai, H., et al.: Auggpt: Leveraging chatgpt for text data augmentation. arXiv preprint arXiv:2302.13007 (2023)
Di Geronimo, L., Braz, L., Fregnan, E., Palomba, F., Bacchelli, A.: Ui dark patterns and where to find them: a study on mobile applications and user perception. In: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, pp. 1–14 (2020)
Dubiel, M., Sergeeva, A., Leiva, L.A.: Impact of voice fidelity on decision making: A potential dark pattern? arXiv preprint arXiv:2402.07010 (2024)
Ekin, S.: Prompt engineering for chatgpt: a quick guide to techniques, tips, and best practices. Authorea Preprints (2023)
Goceri, E.: Medical image data augmentation: techniques, comparisons and interpretations. Artif. Intell. Rev. 56(11), 12561–12605 (2023)
Gray, C.M., Kou, Y., Battles, B., Hoggatt, J., Toombs, A.L.: The dark (patterns) side of UX design. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pp. 1–14 (2018)
Gray, C.M., Santos, C., Bielova, N., Mildner, T.: An ontology of dark patterns knowledge: Foundations, definitions, and a pathway for shared knowledge-building. In: 2024 CHI Conference on Human Factors in Computing Systems (in press)
Hu, E.J., et al.: Lora: Low-rank adaptation of large language models. arXiv preprint arXiv:2106.09685 (2021)
for Justice, D.G., Commission), C.E., et al.: Behavioural study on unfair commercial practices in the digital environment: dark patterns and manipulative personalisation: final report. Publications Office of the European Union, LU (2022). https://data.europa.eu/doi/10.2838/859030
Khurana, D., Koli, A., Khatter, K., Singh, S.: Natural language processing: state of the art, current trends and challenges. Multimedia Tools Appl. 82(3), 3713–3744 (2023)
Kocyigit, E., Rossi, A., Lenzini, G.: Towards assessing features of dark patterns in cookie consent processes. In: IFIP International Summer School on Privacy and Identity Management, pp. 165–183. Springer (2022)
Kocyigit, E., Rossi, A., Lenzini, G.: A systematic approach for a reliable detection of deceptive design patterns through measurable HCI features. In: Proceedings of the 2024 European Symposium on Usable Security, pp. 290–308, New York, NY, USA (2024). Association for Computing Machinery. https://doi.org/10.1145/3688459.3688475
Krauss, V., et al.: What makes XR dark? examining emerging dark patterns in augmented and virtual reality through expert co-design. ACM Trans. Comput. Human Interact. (2024)
Li, B., Hou, Y., Che, W.: Data augmentation approaches in natural language processing: a survey. AI Open 3, 71–90 (2022)
Liang, X., et al.: Task oriented in-domain data augmentation. arXiv preprint arXiv:2406.16694 (2024)
Liga, D., Robaldo, L.: Fine-tuning GPT-3 for legal rule classification. Comput. Law Secur. Rev. 51, 105864 (2023)
Lin, X., et al.: Data-efficient fine-tuning for llm-based recommendation. arXiv preprint arXiv:2401.17197 (2024)
Mansur, S.H., Salma, S., Awofisayo, D., Moran, K.: Aidui: toward automated recognition of dark patterns in user interfaces. In: 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE), pp. 1958–1970. IEEE (2023)
Mathur, A., et al.: Dark patterns at scale: Findings from a crawl of 11k shopping websites. Proce. ACM Human-comput. Interact. 3(CSCW), 1–32 (2019)
Mathur, A., Kshirsagar, M., Mayer, J.: What makes a dark pattern… dark? Design attributes, normative considerations, and measurement methods. In: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, pp. 1–18 (2021)
Mumuni, A., Mumuni, F.: Data augmentation: a comprehensive survey of modern approaches. Array 16, 100258 (2022)
Naheyan, T., Oyibo, K.: The effect of dark patterns and user knowledge on user experience and decision-making. In: International Conference on Persuasive Technology, pp. 190–206. Springer (2024)
Narayanan, A., Mathur, A., Chetty, M., Kshirsagar, M.: Dark patterns: past, present, and future: the evolution of tricky user interfaces. Queue 18(2), 67–92 (2020)
Naveed, H., Anwar, S., Hayat, M., Javed, K., Mian, A.: Survey: image mixing and deleting for data augmentation. Eng. Appl. Artif. Intell. 131, 107791 (2024)
Potel-Saville, M., Da Rocha, M.: From dark patterns to fair patterns? Usable taxonomy to contribute solving the issue with countermeasures. In: Annual Privacy Forum, pp. 145–165. Springer (2023)
Saghiri, A.M., Vahidipour, S.M., Jabbarpour, M.R., Sookhak, M., Forestiero, A.: A survey of artificial intelligence challenges: analyzing the definitions, relationships, and evolutions. Appl. Sci. 12(8), 4054 (2022)
Santos, C., Rossi, A.: The emergence of dark patterns as a legal concept in case law. Internet Policy Rev. (2023). https://policyreview.info/articles/news/emergence-of-dark-patterns-as-a-legal-concept
Shorten, C., Khoshgoftaar, T.M., Furht, B.: Text data augmentation for deep learning. J. Big Data 8(1), 101 (2021)
Soe, T.H., Nordberg, O.E., Guribye, F., Slavkovik, M.: Circumvention by design-dark patterns in cookie consent for online news outlets. In: Proceedings of the 11th Nordic Conference On Human-computer Interaction: Shaping Experiences, shaping society, pp. 1–12 (2020)
Touvron, H., et al.: Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971 (2023)
Touvron, H., et al.: Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023)
Wang, J., Perez, L., et al.: The effectiveness of data augmentation in image classification using deep learning. Convolutional Neural Netw. Vis. Recognit. 11(2017), 1–8 (2017)
Wei, J., et al.: Emergent abilities of large language models. arXiv preprint arXiv:2206.07682 (2022)
Wolf, T., et al.: Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771 (2019)
Yada, Y., Feng, J., Matsumoto, T., Fukushima, N., Kido, F., Yamana, H.: Dark patterns in e-commerce: a dataset and its baseline evaluations. In: 2022 IEEE International Conference on Big Data (Big Data), pp. 3015–3022. IEEE (2022)
Zhao, W.X., et al.: A survey of large language models. arXiv preprint arXiv:2303.18223 (2023)