Comparation of Dice Similarity and Jaccard Coefficience Against Winnowing Algorithm For Similarity Detection of Indonesian Text Documents

Santi Purwaningrum, Agus Susanto, Nur Wachid Adi Prasetya

Abstract


Plagiarism is the act of imitating and quoting and even copying or acknowledging other people's work as one's own work. Plagiarism is currently growing rapidly, especially in the world of education. So that plagiarism detection is needed to prevent plagiarism from growing rapidly. In response to this, this paper intends to conduct research that compares the dice similarity and the jaccard coefficient to find the best document similarity value level against the Winnowing algorithm which functions to find the fingerprint value of each document. The test results show that the winnowing algorithm is quite good at using the dice similarity level with the results of an average similarity value of 71.17615%  than testing using jaccard coefficient with the resulting value 35,58837%.


Full Text:

PDF

References


Stephen Fishman, Public Domain: How To Fid & Use Copyright-Free Writings, Music, Art & More, 4th ed. Berkeley: nolo, 2008.

B. Sari and Y. Sibaroni, “Deteksi Kemiripan Dokumen Bahasa,” vol. 4, pp. 87–98, 2019, doi: 10.21108/indojc.2019.4.3.365.

D. K. Bhattacharyya, “Plagiarism : Taxonomy , Tools and Detection Techniques Plagiarism and Its Types,” 2016.

N. C. Haryanto, L. D. Krisnawati, and A. R. Chrismanto, “Temu Kembali Dokumen Sumber Rujukan dalam Sistem Daur Ulang Teks,” Jurnal Teknologi dan Sistem Komputer, vol. 8, no. 2. pp. 140–149, 2020.

R. K. Wibowo and K. Hastuti, “PENERAPAN ALGORITMA WINNOWING UNTUK MENDETEKSI KEMIRIPAN TEKS PADA TUGAS AKHIR MAHASISWA,” Techno.COM, vol. 15, no. 4, pp. 303–311, 2016.

L. Sibarani, M. Magdalena, and A. Dharma, “Analisa Perbandingan Sistem Pendeteksian Kemiripan Judul Skripsi Menggunakan Algoritma Winnowing Dan Algoritma Rabin Karp,” REMIK (Riset dan E-Jurnal Manaj. Inform. Komputer), vol. 4, no. 1, p. 69, 2019, doi: 10.33395/remik.v4i1.10174.

N. Alamsyah et al., “Deteksi plagiarisme tingkat kemiripan judul skripsi pada fakultas teknologi informasi menggunakan algoritma winnowing,” vol. 10, no. 4, pp. 197–201, 2019.

N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,” Technol. J. Ilm., vol. 8, no. 3, p. 124, 2017, doi: 10.31602/tji.v8i3.1116.

N. P. Putra and Sularno, “Penerapan Algoritma Rabin-Karp Dengan Pendekatan Synonym Recognition Sebagai Antisipasi Plagiarisme Pada Penulisan Skripsi,” J. Teknol. Dan Sist. Inf. Bisnis, vol. 1, no. 2, pp. 49–58, 2019.

TP Vartanian, Secondary data analysis. Oxford University Press: Oxford University Press, 2010.

F. S. Martins, J. A. C. da Cunha, and F. A. R. Serra, “Secondary Data in Research – Uses and Opportunities,” Pod. Sport. Leis. Tour. Rev., vol. 7, no. 3, pp. I–IV, 2018, doi: 10.5585/podium.v7i3.316.

S. Sunardi, A. Yudhana, and I. A. Mukaromah, “Indonesia Words Detection Using Fingerprint Winnowing Algorithm,” J. Inform., vol. 13, no. 1, p. 7, 2019, doi: 10.26555/jifo.v13i1.a8452.

S. Sunardi, A. Yudhana, and I. A. Mukaromah, “Implementasi Deteksi Plagiarisme Menggunakan Metode N-Gram Dan Jaccard Similarity Terhadap Algoritma Winnowing,” Transmisi, vol. 20, no. 3, p. 105, 2018, doi: 10.14710/transmisi.20.3.105-110.

M. Y. Soleh and A. Purwarianti, “A Non Word Error Spell Checker for Indonesian using Morphologically Analyzer and HMM,” no. July, 2011.

F. Amin and E. Winarno, “Rancang Bangun Sistem Temu Kembali Informasi ( Information Retrieval System ) Dokumen Berbahasa Jawa menggunakan Metode DICE Similarity,” vol. 21, no. 2, pp. 99–106, 2016.

D. Gupta and V. K, “Investigating the Impact of Combined Similarity Metrics and POS tagging in Extrinsic Text Plagiarism Detection System Vani,” pp. 1578–1584, 2015.

J. Evan Harya Chandra, V. Christiani M, and D. S.Naga, “Plagiarisme Abstrak Menggunakan Algoritma Winnowing dan Synsets,” J. Ilmu Komput. dan Sist. Inf., pp. 121–129, 2016.

L. J. Purba and L. Sitorus, “Perancangan Aplikasi Untuk Menghitung Persentase Kemiripan Proposal Dan Isi Skripsi Dengan Algoritma Rabin-Karp,” J. Tek. Inform. Unika St. Thomas, vol. 3, no. 1, pp. 17–25, 2018.




DOI: https://doi.org/10.33633/jais.v6i1.4453

Article Metrics

Abstract view : 348 times
PDF - 342 times

Refbacks

  • There are currently no refbacks.


Flag Counter

 

 

 

 

Journal of Applied Intelligent System (e-ISSN : 2502-9401p-ISSN : 2503-0493) is published by Department of Informatics Universitas Dian Nuswantoro Semarang and IndoCEISS.

  

 

Journal of Applied Intelligent System indexed by :


This journal is under licensed of Creative Commons Attribution 4.0 International License.

Visitor Stats