%A Jamal, Ade %A Handayani, Annisa %A Septiandri, Ali Akbar %A Ripmiatin, Endang %A Effendi, Yunus %D 2018 %T Dimensionality Reduction using PCA and K-Means Clustering for Breast Cancer Prediction %K %X Breast cancer is the most important cause of death among women. A prediction of breast cancer in early stage provides a greater possibility of its cure. It needs a breast cancer prediction tool that can classify a breast tumor whether it was a harmful malignant tumor or un-harmful benign tumor. In this paper, two algorithms of machine learning, namely Support Vector Machine and Extreme Gradient Boosting technique will be compared for classification purpose. Prior to the classification, the number of data attribute will be reduced from the raw data by extracting features using Principal Component Analysis. A clustering method, namely K-Means is also used for dimensionality reduction besides the Principal Component Analysis. This paper will present a comparison among four models based on two dimensionality reduction methods combined with two classifiers which applied on Wisconsin Breast Cancer Dataset. The comparison will be measured by using accuracy, sensitivity and specificity metrics evaluated from the confusion matrices. The experimental results have indicated that the K-Means method, which is not usually used for dimensionality reduction can perform well compared to the popular Principal Component Analysis . %U https://ojs.unud.ac.id/index.php/lontar/article/view/42796 %J Lontar Komputer : Jurnal Ilmiah Teknologi Informasi %0 Journal Article %R 10.24843/LKJITI.2018.v09.i03.p08 %P 192-201%@ 2541-5832 %8 2018-12-22