Implementation Equal-Width Interval Discretization in Naive Bayes Method for Increasing Accuracy of Students' Majors Prediction

  • Alfa Saleh Universitas Potensi Utama
  • Fina Nasari Faculty of Computer Science and Engineering, Potensi Utama University

Abstract

The Selection of majors for students is a positive step that is done to focus students in accordance with their potential, it is considered important because with the majors, students are expected to develop academic ability according to the field of interest. In previous research, Naive Bayes method has been tested to classify the students department based on the criteria that support the case study on Private Madrasah Aliyah PAB 6 Helvetia students and the accuracy of the test from 100 student data is 90%. in this study, the researcher developed a previously used method by applying an equal-width interval discretization that would transform numerical or continuous criteria into a categorical criteria with a predetermined k value, different k values ??would be tested to find the best accuracy value. from the 120-student data that have been tested, it is proved that the result of the classification of the application of equal-width interval discretization on the Naive Bayes method with the value of k = 8 is better and increased the accuracy value 91.7% to 93.3%.

Downloads

Download data is not yet available.

References

[1] A. Saleh, “KLASIFIKASI METODE NAIVE BAYES DALAM DATA MINING UNTUK MENENTUKAN KONSENTRASI SISWA ( STUDI KASUS DI MAS PAB 2 MEDAN),” in Konferensi Nasional Pengembangan Teknologi Informasi dan Komunikasi (KeTIK) 2014, 2014, pp. 200–207.
[2] X. Zhou, S. Wang, W. Xu, G. Ji, P. Phillips, P. Sun, and Y. Zhang, “Detection of Pathological Brain in MRI Scanning Based on Wavelet-Entropy and Naive Bayes Classifier,” Springer, Cham, 2015, pp. 201–209.
[3] L. Jiang, C. Li, S. Wang, and L. Zhang, “Deep feature weighting for naive Bayes and its application to text classification,” Engineering Application of Artificial Inteligence, vol. 52, pp. 26–39, Jun. 2016.
[4] B. Tang, S. Kay, and H. He, “Toward Optimal Feature Selection in Naive Bayes for Text Categorization,” Feb. 2016.
[5] A. M. P. and D. S. R., “A sequential naïve Bayes classifier for DNA barcodes,” Stat. Appl. Genet. Mol. Biol., vol. 13, no. 4, pp. 1–12, 2014.
[6] J. Wu, S. Pan, X. Zhu, Z. Cai, P. Zhang, and C. Zhang, “Self-adaptive attribute weighting for Naive Bayes classification,” Expert Systems With Application, vol. 42, no. 3, pp. 1487–1502, Feb. 2015.
[7] N. Mohamad, N. Jusoh, Z. Htike, and S. Win, “Bacteria identification from microscopic morphology using naive bayes,” International Journal of Computer Science, Engineering and Information Technology (IJCSEIT ), vol. 4, no. 2, pp. 1–9, 2014.
[8] Y. Zhang, S. Wang, P. Phillips, and G. Ji, “Binary PSO with mutation operator for feature selection using decision tree applied to spam detection,” Knowledge-Based Systems, vol. 64, pp. 22–31, Jul. 2014.
[9] S. Kotsiantis, “Integrating Global and Local Application of Naive Bayes Classifier.,” Inter-national Arab Journal of Information Technology, vol. 11, no. 3, pp. 300–307, 2014.
[10] S. Palaniappan and T. Kim Hong, “Discretization of continuous valued dimensions in OLAP data cubes,” International Journal of Computer Science and Network Security, vol. 8, no. 11, pp. 116–126, 2008.
[11] I. Kareem and M. Duaimi, “Improved accuracy for decision tree algorithm based on unsupervised discretization,” International Journal of Computer Science and Mobile Computing, vol. 3, no. 6, pp. 176–183, 2014.
[12] G. Forman, “An Extensive Empirical Study of Feature Selection Metrics for Text Classification,” The Journal of Machine Learning Research, vol. 3, no. Mar, pp. 1289–1305, 2003.
[13] Y. Yang and J. Pedersen, “A comparative study on feature selection in text categorization,” in 14th International Conference on Machine Learning, 1997, pp. 412–420.
[14] A. Genkin, D. D. Lewis, and D. Madigan, “Large-Scale Bayesian Logistic Regression for Text Categorization,” Technometrics, vol. 49, no. 3, pp. 291–304, Aug. 2007.
[15] B. Tang and H. He, “ENN: Extended Nearest Neighbor Method for Pattern Recognition [Research Frontier],” IEEE Computational Intelligence Magazine, vol. 10, no. 3, pp. 52–60, Aug. 2015.
[16] A. Saleh, “Implementasi Metode Klasifikasi Naïve Bayes Dalam Memprediksi Besarnya Penggunaan Listrik Rumah Tangga,” Creat. Inf. Technol. J., vol. 2, no. 3, pp. 207–217, 2015.
[17] A. Al-Ibrahim, “Discretization of Continuous Attributes in Supervised Learning algorithms,” Res. Bull. Jordan ACM, vol. 2, no. 4, pp. 158–166, 2011.
[18] D. Joiţa, “Unsupervised static discretization methods in data mining,” Titu Maiorescu University, 2010.
Published
2018-09-15
How to Cite
SALEH, Alfa; NASARI, Fina. Implementation Equal-Width Interval Discretization in Naive Bayes Method for Increasing Accuracy of Students' Majors Prediction. Lontar Komputer : Jurnal Ilmiah Teknologi Informasi, [S.l.], p. 104-113, sep. 2018. ISSN 2541-5832. Available at: <https://ojs.unud.ac.id/index.php/lontar/article/view/38405>. Date accessed: 23 apr. 2024. doi: https://doi.org/10.24843/LKJITI.2018.v09.i02.p05.
Section
Articles