Implementation Of ETL E-Commerce For Customer Clustering Using RFM And K-Means Clustering

  • Farrikh Alzami Universitas Dian Nuswantoro
  • Fikri Diva Sambasri
  • Fikri Diva Sambasri
  • Rifqi Mulya Kiswanto
  • Rama Aria Megantara
  • Ahmad Akrom
  • Ricardus Anggi Pramunendar
  • Dwi Puji Prabowo
  • Puri Sulistiyawati

Abstract

E-commerce is the activity of selling and buying goods through an online system or online. One of the business models in which consumers sell products to other consumers is the Customer to Customer (C2C) business model. One of the things that need to be considered in this business model is knowing the level of customer loyalty. By knowing the level of customer loyalty, the company can provide several different treatments to its customers so that they can maintain good relations with customers and can increase product purchase revenue. In this study, the author wants to segment customers on data in E-commerce companies in Brazil using the K-Means clustering algorithm using the RFM (Recency, Frequency, Monetary) feature. There are also several ETL stages of research that must be carried out, namely taking data from the open public data site (Kaggle), which consist of more than 9 tables (extract), then merging the data to select some data that needs to be used (transform and load), understanding data by displaying it in graphic form, conducting data selection to select features / attributes. which is in accordance with the proposed method, performs data preprocessing, and creates a model to get the cluster. Based on the results of the research that has been done, the number of clusters is 4 clusters with the evaluation value of the model using the silhouette score is 0.470.

Published
2022-12-30
How to Cite
ALZAMI, Farrikh et al. Implementation Of ETL E-Commerce For Customer Clustering Using RFM And K-Means Clustering. Jurnal Ilmiah Merpati (Menara Penelitian Akademika Teknologi Informasi), [S.l.], v. 10, n. 3, p. 167-179, dec. 2022. ISSN 2685-2411. Available at: <https://ojs.unud.ac.id/index.php/merpati/article/view/93895>. Date accessed: 21 nov. 2024. doi: https://doi.org/10.24843/JIM.2022.v10.i03.p05.

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.