Article (Scientific journals)
Unsupervised Modelling of E-Customers’ Profiles: Multiple Correspondence Analysis with Hierarchical Clustering of Principal Components and Machine Learning Classifiers
Vrhovac, V.; OROSNJAK, Marko; Ristić, K. et al.
2024In Mathematics, 12 (23)
Peer Reviewed verified by ORBi
 

Files


Full Text
mathematics-12-03794.pdf
Author postprint (5.7 MB) Creative Commons License - Attribution
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
customer profiles; demographics; e-commerce; hierarchical clustering; machine learning; multiple correspondence analysis; user preferences
Abstract :
[en] The rapid growth of e-commerce has transformed customer behaviors, demanding deeper insights into how demographic factors shape online user preferences. This study performed a threefold analysis to understand the impact of these changes. Firstly, this study investigated how demographic factors (e.g., age, gender, education) influence e-customer preferences in Serbia. From a sample of n = 906 respondents, conditional dependencies between demographics and user preferences were tested. From a hypothetical framework of 24 tested hypotheses, this study successfully rejected 8/24 (with p < 0.05), suggesting a high association between demographics with purchase frequency and reasons for quitting the purchase. However, although the reported test statistics suggested an association, understanding how interactions between categories shape e-customer profiles was still required. Therefore, the second part of this study considers an MCA-HCPC (Multiple Correspondence Analysis with Hierarchical Clustering on Principal Components) to identify user profiles. The analysis revealed three main clusters: (1) young, female, unemployed e-customers driven mainly by customer reviews; (2) retirees and older adults with infrequent purchases, hesitant to buy without experiencing the product in person; and (3) employed, highly educated, male, middle-aged adults who prioritize fast and accurate delivery over price. In the third stage, the clusters are used as labels for Machine Learning (ML) classification tasks. Particularly, Gradient Boosting Machine (GBM), Decision Tree (DT), k-Nearest Neighbors (kNN), Gaussian Naïve Bayes (GNB), Random Forest (RF), and Support Vector Machine (SVM) were used. The results suggested that GBM, RF, and SVM had high classification performance in identifying user profiles. Lastly, after performing Permutation Feature Importance (PFI), the findings suggested that age, work status, education, and income are the main determinants of shaping e-customer profiles and developing marketing strategies. © 2024 by the authors.
Disciplines :
Marketing
Computer science
Mathematics
Physical, chemical, mathematical & earth Sciences: Multidisciplinary, general & others
Author, co-author :
Vrhovac, V.;  Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia
OROSNJAK, Marko  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE)
Ristić, K.
Sremčev, N.
Jocanović, M.
Spajić, J.
Brkljač, N.
External co-authors :
yes
Language :
English
Title :
Unsupervised Modelling of E-Customers’ Profiles: Multiple Correspondence Analysis with Hierarchical Clustering of Principal Components and Machine Learning Classifiers
Publication date :
2024
Journal title :
Mathematics
eISSN :
2227-7390
Publisher :
Multidisciplinary Digital Publishing Institute (MDPI)
Volume :
12
Issue :
23
Peer reviewed :
Peer Reviewed verified by ORBi
Focus Area :
Computational Sciences
Development Goals :
17. Partnerships for the goals
Funders :
Ministry of Science, Technological Development and Innovation
Funding number :
451-03-65/2024-03/200156
Funding text :
This research has been supported by the Ministry of Science, Technological Development and Innovation (Contract No. 451-03-65/2024-03/200156) and the Faculty of Technical Sciences, University of Novi Sad through project \u201CScientific and Artistic Research Work of Researchers in Teaching and Associate Positions at the Faculty of Technical Sciences, University of Novi Sad\u201D (No. 01-3394/1).
Available on ORBilu :
since 20 January 2025

Statistics


Number of views
116 (0 by Unilu)
Number of downloads
35 (0 by Unilu)

Scopus citations®
 
2
Scopus citations®
without self-citations
1
OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu