Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods

A problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process...

Full description

Bibliographic Details
Main Authors: Rahma Wati Sembiring Brahmana, Fahd Agodzo Mohammed, Kankamol Chairuang
Format: Article
Language:Indonesian
Published: Universitas Udayana 2020-04-01
Series:Lontar Komputer
Online Access:https://ojs.unud.ac.id/index.php/lontar/article/view/58025
id doaj-5381368098614ad7b07d8d15175bea1e
record_format Article
spelling doaj-5381368098614ad7b07d8d15175bea1e2020-11-25T03:33:49ZindUniversitas UdayanaLontar Komputer2088-15412541-58322020-04-01111324310.24843/LKJITI.2020.v11.i01.p0458025Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN MethodsRahma Wati Sembiring Brahmana0Fahd Agodzo Mohammed1Kankamol Chairuang2Department of Information Technology, Faculty of Engineering, Udayana UniversityDepartment of Computer Engineering, Chandigarh UniversityDepartment of Business Administration, Chandigarh UniversityA problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process by combining the RFM model with K-Means, K-Medoids, and DBSCAN algorithms. This paper analyzes 334,641 transaction data and converts them to 1661 Recency, Frequency, and Monetary (RFM) data lines to identify potential customers. The K-Means, K-Medoids, and DBSCAN algorithms are very sensitive for initializing the cluster center because it is done randomly. Clustering is done by using two to six clusters. The trial process in the K-Means and K-Medoids Method is done using random centroid values ??and at DBSCAN is done using random Epsilon and Min Points, so that a cluster group is obtained that produces potential customers. Cluster validation completes using the Davies-Bouldin Index and Silhouette Index methods. The result showed that K-Means had the best level of validity than K-Medoids and DBSCAN, where the Davies-Bouldin Index yield was 0,33009058, and the Silhouette Index yield was 0,912671056. The best number of clusters produced using the Davies Bouldin Index and Silhouette Index are 2 clusters, where each K-Means, K-Medoids, and DBSCAN algorithms provide the Dormant and Golden customer classes.https://ojs.unud.ac.id/index.php/lontar/article/view/58025
collection DOAJ
language Indonesian
format Article
sources DOAJ
author Rahma Wati Sembiring Brahmana
Fahd Agodzo Mohammed
Kankamol Chairuang
spellingShingle Rahma Wati Sembiring Brahmana
Fahd Agodzo Mohammed
Kankamol Chairuang
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
Lontar Komputer
author_facet Rahma Wati Sembiring Brahmana
Fahd Agodzo Mohammed
Kankamol Chairuang
author_sort Rahma Wati Sembiring Brahmana
title Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
title_short Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
title_full Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
title_fullStr Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
title_full_unstemmed Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
title_sort customer segmentation based on rfm model using k-means, k-medoids, and dbscan methods
publisher Universitas Udayana
series Lontar Komputer
issn 2088-1541
2541-5832
publishDate 2020-04-01
description A problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process by combining the RFM model with K-Means, K-Medoids, and DBSCAN algorithms. This paper analyzes 334,641 transaction data and converts them to 1661 Recency, Frequency, and Monetary (RFM) data lines to identify potential customers. The K-Means, K-Medoids, and DBSCAN algorithms are very sensitive for initializing the cluster center because it is done randomly. Clustering is done by using two to six clusters. The trial process in the K-Means and K-Medoids Method is done using random centroid values ??and at DBSCAN is done using random Epsilon and Min Points, so that a cluster group is obtained that produces potential customers. Cluster validation completes using the Davies-Bouldin Index and Silhouette Index methods. The result showed that K-Means had the best level of validity than K-Medoids and DBSCAN, where the Davies-Bouldin Index yield was 0,33009058, and the Silhouette Index yield was 0,912671056. The best number of clusters produced using the Davies Bouldin Index and Silhouette Index are 2 clusters, where each K-Means, K-Medoids, and DBSCAN algorithms provide the Dormant and Golden customer classes.
url https://ojs.unud.ac.id/index.php/lontar/article/view/58025
work_keys_str_mv AT rahmawatisembiringbrahmana customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods
AT fahdagodzomohammed customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods
AT kankamolchairuang customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods
_version_ 1724561398681305088