Ekstrak Hirarki Data Dari Situs Web A-Z Animals Menggunakan Web Scraping
Abstract
A-Z Animals is a website that presents data about Kingdom Animalia. The Kingdom Animalia data has a hierarchy or level called the taxon level, which starts from kingdom to species. The problems encountered are the data contained on the website can be reuse for other purposes, such as creating dictionaries, learning media and others, but it takes a long time to enter data into the database due to the many and the complexity of the data. The solution of the problem is to create an application that can automatically retrieve data from the website to speed up data collection.Web Scraping is a method to retrieve documents from a website from the internet, in the form of HTML, next analyzed to retrieve certain data from the document. The results of tests sowed applications can retrieve content or data required from the website a-z-animal.com. The application takes an average time to process one page of a-z-animal.com is about 16.13 seconds.
Downloads
References
[2] A. Josi, L. A. Abdillah, and Suryayusra, "Penerapan Teknik Web Scrapping pada Mesin Pencari Artikel Ilmiah," Jurnal Sistem Informasi, vol. 5, no. 2, pp. 159-164, 2014.
[3] I. D. G. W. Dhiyatmika, I. K. D. Putra, and N. M. I. M. Mandenni, "Aplikasi Augmented Reality Magic Book Pengenalan Binatang untuk Siswa TK," Lontar Komputer : Jurnal Ilmiah Teknologi Informasi, vol. 6, no. 2, pp. 120-127, 2015.
[4] Wamiliana, D. Kurniasari, and J. S. Nugraha, "Pembuatan Media Pembelajaran Pengenalan Tata Surya dan Exoplanet Dengan Menggunakan Unity untuk Sekolah Menengah Pertama," Jurnal Komputasi, vol. 1, no. 1, pp. 47-57, 2013.
[5] F. Polidoro, R. Giannini, R. L. Conte, S. Mosca, and F. Rossetti, "Web scraping techniques to collect data on consumer electronics and airfares for Italian HICP compilation," Statistical Journal of the IAOS, pp. 165–176, 2015.
[6] M. A. Pise and P. J. Adhikari, "A Review: Data Extraction from multiple web databases," IJRITCC, vol. 3, no. 10, pp. 5930-5932, 2015.
[7] M. S. Utomo, "Web Scraping pada Situs Wikipedia menggunakan Metode Ekspresi Regular," Jurnal Teknologi Informasi DINAMIK vol. 18, no. 2, pp. 153-160, 2013.
[8] M. A. Ruggiero, D. P. Gordon, T. M. Orrell, N. Bailly, T. Bourgoin, R. C. Brusca, et al., "A Higher Level Classification of All Living Organisms," PLOS ONE, pp. 1-54, 2015.
[9] I. G. B. A. Pinatih, A. A. K. Oka Sudana, and I. K. Adi Purnawan, "E-Banjar Bali, Population Census Management Information System of Banjar in Bali by Using Family Tree Method and Balinese Culture Law," Journal of Theoretical and Applied Information Technology, vol. 59, no. 2, pp. 411-420, 2014.
[10] A. A. K. Oka Sudana, I. W. G. M. Kepakisan, and N. K. D. Rusjayanthi, "Implementation of Tree Structure and Recursive Algorithm for Balinese Traditional Snack Recipe on Android Based Application " International Journal of Interactive Mobile Technologies, vol. 10, no. 4, pp. 43-47, 2016.
[11] I. M. W. Saputra, A. A. K. Oka Sudana, and I. M. Sukarsa, "Implementasi Struktur Data tree pada Sistem Informasi Upacara yadnya Berbasis Android," Lontar Komputer : Jurnal Ilmiah Teknologi Informasi, vol. 2, no. 1, pp. 326-334, 2014.
The Authors submitting a manuscript do so on the understanding that if accepted for publication, the copyright of the article shall be assigned to Jurnal Lontar Komputer as the publisher of the journal. Copyright encompasses exclusive rights to reproduce and deliver the article in all forms and media, as well as translations. The reproduction of any part of this journal (printed or online) will be allowed only with written permission from Jurnal Lontar Komputer. The Editorial Board of Jurnal Lontar Komputer makes every effort to ensure that no wrong or misleading data, opinions, or statements be published in the journal.
This work is licensed under a Creative Commons Attribution 4.0 International License.