Implementation of Document Clustering in Online News Using K-Means Clustering

  • Yogiswara Dharma Putra Department of Electrical and Computer Engineering, Post Graduate Program, Udayana University
  • Ni Wayan Sri Ariyani Department of Electrical and Computer Engineering, Post Graduate Program, Udayana University
  • Ida Bagus Alit Swamardika Department of Electrical and Computer Engineering, Post Graduate Program, Udayana University


The development of technology when making an explosion of the number of news or news documents that are very much on the internet, it is necessary to have clustering done in dividing these documents so that they can be adjusted based on the category of the online news. Application of document clustering can increase the effectiveness of information retrieval by referring to a hypothesis that relevant documents will tend to be in the same cluster if a collection of documents has been clustered. This research aims to try to do clustering on online news about Covid-19 taken from three online news websites contained in XML files. K-Means clustering is used as a grouping of online news by using an open-source application, Carrot2 Workbench which turns out to be able to generate nine clusters of "Covid-19" queries entered in the Carrot2 application.


