Cross-Language Text Document Plagiarism Detection System Using Winnowing Method
DOI:
https://doi.org/10.33633/jais.v7i1.5950Abstract
Currently, there are many text documents such as journals scattered on the internet, both Indonesian and English-language journals. With this, it is possible to act plagiarism by copying from foreign journals that are translated into other languages or copying directly without being changed from the original language. One way that can suppress these actions is to build a plagiarism detection system for cross-language text documents. The method that can be used to detect document plagiarism is the Winnowing method. Winnowing method is a method where text input will be processed to produce a hash value called a fingerprint. This study aims to build a system that can detect plagiarism of text documents in different languages using the Winnowing method. Text documents that can be tested are input text and PDF files. Documents used in system testing are journals that have the same topic. The results of the highest level of accuracy produced between the calculation of the Jaccard Coefficient with the Plagiarism Checker X application are in the fourth scenario with an average percentage value of 84.7%.References
B. Agarwal, “Cross-lingual plagiarism detection techniques for English-Hindi language pairs,” Journal of Discrete Mathematical Sciences and Cryptography, vol. 22, no. 4, pp. 679–686, May 2019, doi: 10.1080/09720529.2019.1642626.
A. A. Putri Ratna, F. Astha Ekadiyanto, I. Ibrahim, D. Husna, and F. Rahimullah, “Investigating Parallelization of Cross-language Plagiarism Detection System Using The Winnowing Algorithm in Cloud Based Implementation,” in 2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST), Oct. 2019, pp. 1–7. doi: 10.1109/ICAwST.2019.8923539.
I. Ilham and P. Pasnur, “Penerapan Algoritma Winnowing Untuk Mendeteksi Kemiripan Pada Karya Tulis Mahasiswa,” Inspiration : Jurnal Teknologi Informasi dan Komunikasi, vol. 7, no. 2, Dec. 2017, doi: 10.35585/inspir.v7i2.2447.
I. W. S. Priantara, D. Purwitasari, and U. L. Yuhana, “Implementasi deteksi penjiplakan dengan algoritma winnowing pada dokumen terkelompok,” Surabaya, 2011.
S. Sugiono, H. Herwin, H. Hamdani, and E. Erlin, “Aplikasi Pendeteksi Tingkat Kesamaan Dokumen Teks: Algoritma Rabin Karp Vs. Winnowing,” Digital Zone: Jurnal Teknologi Informasi dan Komunikasi, vol. 9, no. 1, pp. 82–93, May 2018, doi: 10.31849/digitalzone.v9i1.1242.
Sunardi, A. Yudhana, and I. A. Mukaromah, “PERANCANGAN APLIKASI DETEKSI PLAGIARISME KARYA ILMIAH MENGGUNAKAN ALGORITMA WINNOWING,” 2017.
S. N. Lolyta, R. Y. Dillak, and F. E. Laumal, “Sistem deteksi plagiarisme lintas bahasa menggunakan algoritma tf-idf,” Jurnal Ilmiah Flash, vol. 5, no. 1, Jun. 2019.
D. Leman, M. Rahman, F. Ikorasaki, B. S. Riza, and M. B. Akbbar, “Rabin Karp And Winnowing Algorithm For Statistics Of Text Document Plagiarism Detection,” in 2019 7th International Conference on Cyber and IT Service Management (CITSM), Nov. 2019, pp. 1–5. doi: 10.1109/CITSM47753.2019.8965422.
Pardede Jasman and Alvian Leo, “RANCANG BANGUN APLIKASI PENDETEKSI PLAGIARISME MENGGUNAKAI ALGORITMA SHERLOCK,” Jurnl Informatika, vol. 6, no. 1, pp. 39–49, Jan. 2015.
D. Susanto, A. Basuki, and P. Duanda, “Deteksi Plagiat Dokumen Tugas Daring Laporan Praktikum Mata Kuliah Desain Web Menggunakan Metode Naive Bayes,” Nusantara Journal of Computersand its Applications, vol. 2, no. 1, Dec. 2016.
A. Indriani, A. Dahlan Jl Soepomo, and S. Janturan, ANALISA KOREKSI KATA SOAL UJIAN SEMESTER DENGAN ALGORITMA LEVENSHTEIN DISTANCE. 2018.
T. Aprilianto and A. Badawi, “SISTEM KOREKSI KATA DAN PENGENALAN STRUKTUR KALIMAT BERBAHASA INDONESIA DENGAN PENDEKATAN KAMUS BERBASIS LEVENSHTEIN DISTANCE,” 2017.
A. Filcha and M. Hayaty, “Implementasi Algoritma Rabin-Karp untuk Pendeteksi Plagiarisme pada Dokumen Tugas Mahasiswa,” JUITA : Jurnal Informatika, vol. 7, no. 1, p. 25, May 2019, doi: 10.30595/juita.v7i1.4063.
R. Karisma Wibowo and K. Hastuti, “PENERAPAN ALGORITMA WINNOWING UNTUK MENDETEKSI KEMIRIPAN TEKS PADA TUGAS AKHIR MAHASISWA,” 2016.
ACM Digital Library., Association for Computing Machinery. Special Interest Group on Management of Data., and Association for Computing Machinery., Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data : 2003, San Diego, California, June 09-12, 2003. Association for Computing Machinery, 2004.
N. Nurdin and A. Munthoha, “SISTEM PENDETEKSIAN KEMIRIPAN JUDUL SKRIPSI MENGGUNAKAN ALGORITMA WINNOWING,” InfoTekJar (Jurnal Nasional Informatika dan Teknologi Jaringan), vol. 2, no. 1, pp. 90–97, Sep. 2017, doi: 10.30743/infotekjar.v2i1.165.
H. Najjichah, A. Syukur, and H. Subagyo, “PENGARUH TEXT PREPROCESSING DAN KOMBINASINYA PADA PERINGKAS DOKUMEN OTOMATIS TEKS BERBAHASA INDONESIA,” 2019. [Online]. Available: http://research.
N. Alamsyah and M. Rasyidan, “DETEKSI PLAGIARISME TINGKAT KEMIRIPAN JUDUL SKRIPSI PADA FAKULTAS TEKNOLOGI INFORMASI MENGGUNAKAN ALGORITMA WINNOWING,” Technologia: Jurnal Ilmiah, vol. 10, no. 4, p. 197, Oct. 2019, doi: 10.31602/tji.v10i4.2361.
Downloads
Published
Issue
Section
License
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).