Comparation of Dice Similarity and Jaccard Coefficience Against Winnowing Algorithm For Similarity Detection of Indonesian Text Documents
DOI:
https://doi.org/10.33633/jais.v6i1.4453Abstract
Plagiarism is the act of imitating and quoting and even copying or acknowledging other people's work as one's own work. Plagiarism is currently growing rapidly, especially in the world of education. So that plagiarism detection is needed to prevent plagiarism from growing rapidly. In response to this, this paper intends to conduct research that compares the dice similarity and the jaccard coefficient to find the best document similarity value level against the Winnowing algorithm which functions to find the fingerprint value of each document. The test results show that the winnowing algorithm is quite good at using the dice similarity level with the results of an average similarity value of 71.17615% than testing using jaccard coefficient with the resulting value 35,58837%.References
Stephen Fishman, Public Domain: How To Fid & Use Copyright-Free Writings, Music, Art & More, 4th ed. Berkeley: nolo, 2008.
B. Sari and Y. Sibaroni, “Deteksi Kemiripan Dokumen Bahasa,” vol. 4, pp. 87–98, 2019, doi: 10.21108/indojc.2019.4.3.365.
D. K. Bhattacharyya, “Plagiarism : Taxonomy , Tools and Detection Techniques Plagiarism and Its Types,” 2016.
N. C. Haryanto, L. D. Krisnawati, and A. R. Chrismanto, “Temu Kembali Dokumen Sumber Rujukan dalam Sistem Daur Ulang Teks,” Jurnal Teknologi dan Sistem Komputer, vol. 8, no. 2. pp. 140–149, 2020.
R. K. Wibowo and K. Hastuti, “PENERAPAN ALGORITMA WINNOWING UNTUK MENDETEKSI KEMIRIPAN TEKS PADA TUGAS AKHIR MAHASISWA,” Techno.COM, vol. 15, no. 4, pp. 303–311, 2016.
L. Sibarani, M. Magdalena, and A. Dharma, “Analisa Perbandingan Sistem Pendeteksian Kemiripan Judul Skripsi Menggunakan Algoritma Winnowing Dan Algoritma Rabin Karp,” REMIK (Riset dan E-Jurnal Manaj. Inform. Komputer), vol. 4, no. 1, p. 69, 2019, doi: 10.33395/remik.v4i1.10174.
N. Alamsyah et al., “Deteksi plagiarisme tingkat kemiripan judul skripsi pada fakultas teknologi informasi menggunakan algoritma winnowing,” vol. 10, no. 4, pp. 197–201, 2019.
N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,” Technol. J. Ilm., vol. 8, no. 3, p. 124, 2017, doi: 10.31602/tji.v8i3.1116.
N. P. Putra and Sularno, “Penerapan Algoritma Rabin-Karp Dengan Pendekatan Synonym Recognition Sebagai Antisipasi Plagiarisme Pada Penulisan Skripsi,” J. Teknol. Dan Sist. Inf. Bisnis, vol. 1, no. 2, pp. 49–58, 2019.
TP Vartanian, Secondary data analysis. Oxford University Press: Oxford University Press, 2010.
F. S. Martins, J. A. C. da Cunha, and F. A. R. Serra, “Secondary Data in Research – Uses and Opportunities,” Pod. Sport. Leis. Tour. Rev., vol. 7, no. 3, pp. I–IV, 2018, doi: 10.5585/podium.v7i3.316.
S. Sunardi, A. Yudhana, and I. A. Mukaromah, “Indonesia Words Detection Using Fingerprint Winnowing Algorithm,” J. Inform., vol. 13, no. 1, p. 7, 2019, doi: 10.26555/jifo.v13i1.a8452.
S. Sunardi, A. Yudhana, and I. A. Mukaromah, “Implementasi Deteksi Plagiarisme Menggunakan Metode N-Gram Dan Jaccard Similarity Terhadap Algoritma Winnowing,” Transmisi, vol. 20, no. 3, p. 105, 2018, doi: 10.14710/transmisi.20.3.105-110.
M. Y. Soleh and A. Purwarianti, “A Non Word Error Spell Checker for Indonesian using Morphologically Analyzer and HMM,” no. July, 2011.
F. Amin and E. Winarno, “Rancang Bangun Sistem Temu Kembali Informasi ( Information Retrieval System ) Dokumen Berbahasa Jawa menggunakan Metode DICE Similarity,” vol. 21, no. 2, pp. 99–106, 2016.
D. Gupta and V. K, “Investigating the Impact of Combined Similarity Metrics and POS tagging in Extrinsic Text Plagiarism Detection System Vani,” pp. 1578–1584, 2015.
J. Evan Harya Chandra, V. Christiani M, and D. S.Naga, “Plagiarisme Abstrak Menggunakan Algoritma Winnowing dan Synsets,” J. Ilmu Komput. dan Sist. Inf., pp. 121–129, 2016.
L. J. Purba and L. Sitorus, “Perancangan Aplikasi Untuk Menghitung Persentase Kemiripan Proposal Dan Isi Skripsi Dengan Algoritma Rabin-Karp,” J. Tek. Inform. Unika St. Thomas, vol. 3, no. 1, pp. 17–25, 2018.
Downloads
Published
Issue
Section
License
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).