Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods
A problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | Indonesian |
Published: |
Universitas Udayana
2020-04-01
|
Series: | Lontar Komputer |
Online Access: | https://ojs.unud.ac.id/index.php/lontar/article/view/58025 |
id |
doaj-5381368098614ad7b07d8d15175bea1e |
---|---|
record_format |
Article |
spelling |
doaj-5381368098614ad7b07d8d15175bea1e2020-11-25T03:33:49ZindUniversitas UdayanaLontar Komputer2088-15412541-58322020-04-01111324310.24843/LKJITI.2020.v11.i01.p0458025Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN MethodsRahma Wati Sembiring Brahmana0Fahd Agodzo Mohammed1Kankamol Chairuang2Department of Information Technology, Faculty of Engineering, Udayana UniversityDepartment of Computer Engineering, Chandigarh UniversityDepartment of Business Administration, Chandigarh UniversityA problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process by combining the RFM model with K-Means, K-Medoids, and DBSCAN algorithms. This paper analyzes 334,641 transaction data and converts them to 1661 Recency, Frequency, and Monetary (RFM) data lines to identify potential customers. The K-Means, K-Medoids, and DBSCAN algorithms are very sensitive for initializing the cluster center because it is done randomly. Clustering is done by using two to six clusters. The trial process in the K-Means and K-Medoids Method is done using random centroid values ??and at DBSCAN is done using random Epsilon and Min Points, so that a cluster group is obtained that produces potential customers. Cluster validation completes using the Davies-Bouldin Index and Silhouette Index methods. The result showed that K-Means had the best level of validity than K-Medoids and DBSCAN, where the Davies-Bouldin Index yield was 0,33009058, and the Silhouette Index yield was 0,912671056. The best number of clusters produced using the Davies Bouldin Index and Silhouette Index are 2 clusters, where each K-Means, K-Medoids, and DBSCAN algorithms provide the Dormant and Golden customer classes.https://ojs.unud.ac.id/index.php/lontar/article/view/58025 |
collection |
DOAJ |
language |
Indonesian |
format |
Article |
sources |
DOAJ |
author |
Rahma Wati Sembiring Brahmana Fahd Agodzo Mohammed Kankamol Chairuang |
spellingShingle |
Rahma Wati Sembiring Brahmana Fahd Agodzo Mohammed Kankamol Chairuang Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods Lontar Komputer |
author_facet |
Rahma Wati Sembiring Brahmana Fahd Agodzo Mohammed Kankamol Chairuang |
author_sort |
Rahma Wati Sembiring Brahmana |
title |
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods |
title_short |
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods |
title_full |
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods |
title_fullStr |
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods |
title_full_unstemmed |
Customer Segmentation Based on RFM Model Using K-Means, K-Medoids, and DBSCAN Methods |
title_sort |
customer segmentation based on rfm model using k-means, k-medoids, and dbscan methods |
publisher |
Universitas Udayana |
series |
Lontar Komputer |
issn |
2088-1541 2541-5832 |
publishDate |
2020-04-01 |
description |
A problem that appears in marketing activities is how to identify potential customers. Marketing activities could identify their best customer through customer segmentation by applying the concept of Data Mining and Customer Relationship Management (CRM). This paper presents the Data Mining process by combining the RFM model with K-Means, K-Medoids, and DBSCAN algorithms. This paper analyzes 334,641 transaction data and converts them to 1661 Recency, Frequency, and Monetary (RFM) data lines to identify potential customers. The K-Means, K-Medoids, and DBSCAN algorithms are very sensitive for initializing the cluster center because it is done randomly. Clustering is done by using two to six clusters. The trial process in the K-Means and K-Medoids Method is done using random centroid values ??and at DBSCAN is done using random Epsilon and Min Points, so that a cluster group is obtained that produces potential customers. Cluster validation completes using the Davies-Bouldin Index and Silhouette Index methods. The result showed that K-Means had the best level of validity than K-Medoids and DBSCAN, where the Davies-Bouldin Index yield was 0,33009058, and the Silhouette Index yield was 0,912671056. The best number of clusters produced using the Davies Bouldin Index and Silhouette Index are 2 clusters, where each K-Means, K-Medoids, and DBSCAN algorithms provide the Dormant and Golden customer classes. |
url |
https://ojs.unud.ac.id/index.php/lontar/article/view/58025 |
work_keys_str_mv |
AT rahmawatisembiringbrahmana customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods AT fahdagodzomohammed customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods AT kankamolchairuang customersegmentationbasedonrfmmodelusingkmeanskmedoidsanddbscanmethods |
_version_ |
1724561398681305088 |