Comparison of K-Nearest Neighbor And Modified K-Nearest Neighbor With Feature Selection Mutual Information And Gini Index In Informatics Journal Classsification

Benedict Emanuel Sutrisna; AAIN Eka Karyawati; Luh Arida Ayu Rahning Putri; I Wayan Santiyasa; Agus Muliantara; I Made Widiartha

doi:10.24843/JLK.2022.v10.i03.p05

Comparison of K-Nearest Neighbor And Modified K-Nearest Neighbor With Feature Selection Mutual Information And Gini Index In Informatics Journal Classsification

Benedict Emanuel Sutrisna Universitas Udayana
AAIN Eka Karyawati
Luh Arida Ayu Rahning Putri
I Wayan Santiyasa
Agus Muliantara
I Made Widiartha

DOI: https://doi.org/10.24843/JLK.2022.v10.i03.p05

Abstract

With the rapid development of informatics where thousands of informatics journals have been made, a new problem has occured where grouping these journals manually has become too difficult and expensive. The writer proposes using text classification for grouping these informatics journals. This research examines the combinations of two machine learning methods, K-Nearest Neighbors (KNN) and Modified K-Nearest Neighbors with two feature selection methods, Gini Index (GI) and Mutual Information (MI) to determine the model that produces the higherst evaluation score. The data are informatics journals stored in pdf files where they are given one of 3 designated labels: Information Retrieval, Database or Others. 252 data were collected from the websites, neliti.com and garuda.ristekbrin.go.id. This research examines and compares which of the two methods, KNN and MKNN at classifying informatics journal as well as determining which combination of parameters and feature selection that produces the best result. This research finds that the combination of method and feature selection that produces the best evaluation score is MKNN with GI as feature selection producing precision score, recall score and f1-score of 97.7%

Downloads

Download data is not yet available.

Published

2022-05-04

How to Cite

SUTRISNA, Benedict Emanuel et al. Comparison of K-Nearest Neighbor And Modified K-Nearest Neighbor With Feature Selection Mutual Information And Gini Index In Informatics Journal Classsification. JELIKU (Jurnal Elektronik Ilmu Komputer Udayana), [S.l.], v. 10, n. 3, p. 287-296, may 2022. ISSN 2654-5101. Available at: <https://ojs.unud.ac.id/index.php/jlk/article/view/85009>. Date accessed: 25 apr. 2025. doi: https://doi.org/10.24843/JLK.2022.v10.i03.p05.

Citation Formats