Metode ROBPCA (Robust Principal Component Analysis) dan Clara (Clustering Large Area) pada Data dengan Outlier

Studi Kasus Data Laporan Indeks Kebahagiaan Dunia Tahun 2018

  • Bekti Endar Susilowati Badan Pusat Statistik Kabupaten Sleman, Yogyakarta
  • Pardomuan Robinson Sihombing Badan Pusat Statistik, Unpad
##plugins.pubIds.doi.readerDisplayName## https://doi.org/10.24843/JIK.2020.v13.i02.p04

Abstrak

Principal Component Analysis (PCA) is one of multivariate analysis used for deputizing variables using less number of Principal Components without losing much information. In other words, it is used for explaining the underlying variance-covariance structure of the large data set of variables through a few linear combinations of these variables. PCA is significantly influenced by the outliers, since the covariant matrix are sensitive to outliers. Thus, the analysis for this study was conducted by using a PCA that is robust to outliers, namely ROBPCA or Hubert PCA. Then, the principal components formed were used as inputs in cluster analysis using the Clara method (Clustering Large Area). Clustering Large Area is one of the k-medoids methods that is robust to outliers and is appropriate for large data analysis. In the case study of the compiling variables of happiness index based on The World Happiness Report (WHR)2018 using the Clara method with Manhattan distance, the best average value of Overall Average Silhouette Width in the 5 clusters were obtained.

##plugins.generic.usageStats.downloads##

##plugins.generic.usageStats.noStats##
Diterbitkan
2020-09-28
##submission.howToCite##
SUSILOWATI, Bekti Endar; SIHOMBING, Pardomuan Robinson. Metode ROBPCA (Robust Principal Component Analysis) dan Clara (Clustering Large Area) pada Data dengan Outlier. Jurnal Ilmu Komputer, [S.l.], v. 13, n. 2, p. 11, sep. 2020. ISSN 2622-321X. Tersedia pada: <https://ojs.unud.ac.id/index.php/jik/article/view/61980>. Tanggal Akses: 26 apr. 2025 doi: https://doi.org/10.24843/JIK.2020.v13.i02.p04.