Deteksi Hate Speech pada Unggahan Media Sosial dengan Naive Bayes Menggunakan Seleksi Fitur Chi-Square

  • Putu Steven Belva Chan Udayana University
  • Ida Ayu Gde Suwiprabayanti Putra Universitas Udayana

Abstract

In the digital age, social media's pervasive use has revolutionized global communication but also introduced challenges like hate speech. This study proposes a Multinomial Naive Bayes model optimized with Chi-square feature selection to detect hate speech efficiently from large-scale social media data. Leveraging machine learning, this approach aims to combat harmful content by identifying relevant text features crucial for distinguishing hate speech from non-hate speech. The study utilizes TF-IDF for feature extraction and Chi-square for feature selection, showing significant performance improvements in hate speech detection. The Chi-square feature selection model yielded average precision, recall, F1-score, and accuracy values of 92%, 92%, 91%, and 92% respectively. In contrast, the model without feature selection achieved values of 89%, 89%, 88%, and 89% for the same metrics. Results demonstrate enhanced accuracy, precision, recall, and F1-score across various hate speech categories.


Keywords: Hate Speech, Naive Bayes, TF-IDF, Chi-square


 
Published
2024-11-01
How to Cite
CHAN, Putu Steven Belva; PUTRA, Ida Ayu Gde Suwiprabayanti. Deteksi Hate Speech pada Unggahan Media Sosial dengan Naive Bayes Menggunakan Seleksi Fitur Chi-Square. Jurnal Nasional Teknologi Informasi dan Aplikasnya, [S.l.], v. 3, n. 1, p. 1055-1062, nov. 2024. ISSN 3032-1948. Available at: <https://ojs.unud.ac.id/index.php/jnatia/article/view/115888>. Date accessed: 09 jan. 2025. doi: https://doi.org/10.24843/JNATIA.2024.v03.i01.p20.