TY - GEN
T1 - Customer Segmentation and Classification Using K-Modes Clustering with Ensemble Learning
AU - Niloy, Shahriar Rahman
AU - Hasan, Toushif Muktashid
AU - Apu, Md Saiduzzaman
AU - Hasan, Rakibul
AU - Shahin, Kamrul Islam
AU - Nguyen, Huu Hoa
AU - Farid, Dewan Md
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - In the contemporary landscape of business intelligence and market analysis, customer segmentation serves as a pivotal tool for understanding consumer behavior and preferences. This paper delves into the application of advanced machine learning techniques, specifically K-Modes clustering and ensemble learning with AdaBoost, for the purpose of customer segmentation and classification. The utilization of K-Modes clustering, an extension of the K-Means algorithm tailored for categorical data, facilitates the identification of distinct groups within a heterogeneous customer base. By incorporating categorical variables, K-Modes accommodates the inherent diversity in customer attributes such as demographic information, purchase history, and product preferences. Furthermore, this research integrates ensemble learning techniques, particularly AdaBoost, to enhance the accuracy and robustness of the segmentation process. Through a comprehensive empirical analysis, conducted on a real-world dataset sourced from Kaggle, the proposed methodology demonstrates superior performance compared to traditional clustering approaches. The experimental results showcase the effectiveness of K-Modes clustering combined with AdaBoost ensemble learning in accurately segmenting customers into meaningful groups, thereby enabling businesses to gain deeper insights into consumer behavior and preferences.
AB - In the contemporary landscape of business intelligence and market analysis, customer segmentation serves as a pivotal tool for understanding consumer behavior and preferences. This paper delves into the application of advanced machine learning techniques, specifically K-Modes clustering and ensemble learning with AdaBoost, for the purpose of customer segmentation and classification. The utilization of K-Modes clustering, an extension of the K-Means algorithm tailored for categorical data, facilitates the identification of distinct groups within a heterogeneous customer base. By incorporating categorical variables, K-Modes accommodates the inherent diversity in customer attributes such as demographic information, purchase history, and product preferences. Furthermore, this research integrates ensemble learning techniques, particularly AdaBoost, to enhance the accuracy and robustness of the segmentation process. Through a comprehensive empirical analysis, conducted on a real-world dataset sourced from Kaggle, the proposed methodology demonstrates superior performance compared to traditional clustering approaches. The experimental results showcase the effectiveness of K-Modes clustering combined with AdaBoost ensemble learning in accurately segmenting customers into meaningful groups, thereby enabling businesses to gain deeper insights into consumer behavior and preferences.
KW - Clustering
KW - Customer Segmentation
KW - Ensemble Learning
KW - Machine Learning
U2 - 10.1007/978-981-97-9613-7_1
DO - 10.1007/978-981-97-9613-7_1
M3 - Article in proceedings
AN - SCOPUS:85209579551
SN - 9789819796120
T3 - Communications in Computer and Information Science
SP - 3
EP - 18
BT - Intelligent Systems and Data Science
A2 - Thai-Nghe, Nguyen
A2 - Do, Thanh-Nghi
A2 - Benferhat, Salem
PB - Springer Science+Business Media
T2 - 2nd International Conference on Intelligent Systems and Data Science, ISDS 2024
Y2 - 9 November 2024 through 10 November 2024
ER -