Sentiment Analysis on Indonesia Twitter Data Using Naïve Bayes and K-Means Method

Authors

  • Ajib Susanto Universitas Dian Nuswantoro Semarang http://orcid.org/0000-0001-5880-6072
  • Muhammad Atho’il Maula Universitas Dian Nuswantoro Semarang
  • Ibnu Utomo Wahyu Mulyono Universitas Dian Nuswantoro Semarang
  • Md Kamruzzaman Sarker Kansas State University

DOI:

https://doi.org/10.33633/jais.v6i1.4465

Abstract

This study focuses on the analysis of sentiments on Indonesian twitter data. Twitter data on Indonesian simultaneous pilkada used to get its sentiments using Naïve Bayes Classifier method as a method of classification and K-means method to get Label on the data train process. Combining the two methods is expected to get high accuracy results. The results obtained from the research shows a pretty good accuracy of 74.5%.

Author Biographies

Ajib Susanto, Universitas Dian Nuswantoro Semarang

Dosen Teknik Informatika S1Fakultas Ilmu KomputerUniversitas Dian Nuswantoro Semarang

Muhammad Atho’il Maula, Universitas Dian Nuswantoro Semarang

Teknik Informatika S1Fakultas Ilmu KomputerUniversitas Dian Nuswantoro Semarang

Ibnu Utomo Wahyu Mulyono, Universitas Dian Nuswantoro Semarang

Dosen Teknik Informatika S1Fakultas Ilmu KomputerUniversitas Dian Nuswantoro Semarang

References

C. Fiarni, H. Maharani, and R. Pratama, “Sentiment analysis system for Indonesia online retail shop review using hierarchy Naive Bayes technique,” 2016 4th Int. Conf. Inf. Commun. Technol. ICoICT 2016, no. May 2016, 2016.

B. A. Sevsa and M. D. R Wahyudi, “Analisis Sentimen pada Indeks Kinerja Dosen Fakultas SAINTEK UIN Sunan Kalijaga Menggunakan Naive Bayes Classifier,” J. Buana Inform., vol. 10, no. 2, p. 112, 2019.

L. Wikarsa and S. N. Thahir, “A text mining application of emotion classifications of Twitter’s users using Naïve Bayes method,” Proceeding 2015 1st Int. Conf. Wirel. Telemat. ICWT 2015, no. November 2015, 2016.

G. Septian, A. Susanto, and G. F. Shidik, “Indonesian news classification based on NaBaNA,” in Proceedings - 2017 International Seminar on Application for Technology of Information and Communication: Empowering Technology for a Better Human Life, iSemantic 2017, 2018, vol. 2018-Janua.

A. N. Ulfah, “Analisis Kinerja Algoritma Fuzzy C-Means Dak-Mean Pada Data Kemiskinann,” Skripsi, vol. 1, no. 2, 2014.

D. H. Wahid and A. SN, “Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity,” IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 10, no. 2, p. 207, 2016.

R. Maskat and N. Abdul Rahman, “Categorization of malay social media text and normalization of spelling variations and vowel-less words,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 10, no. 4, pp. 1380–1386, 2020.

A. F. Hidayatullah, “The influence of stemming on Indonesian tweet sentiment analysis,” in International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), 2015.

M. Mardiani, “Perbandingan Algoritma K-Means dan EM untuk Clusterisasi Nilai Mahasiswa Berdasarkan Asal Sekolah,” Creat. Inf. Technol. J., vol. 1, no. 4, p. 316, 2015.

S. Fajar Rodiyansyah and E. Winarko, “Klasifikasi Posting Twitter Kemacetan Lalu Lintas Kota Bandung Menggunakan Naive Bayesian Classification,” IJCCS (Indonesian J. Comput. Cybern. Syst., 2013.

Downloads

Published

2021-05-10