ANALISIS TINGKAT PLAGIASI DOKUMEN SKRIPSI DENGAN METODE COSINE SIMILARITY DAN PEMBOBOTAN TF-IDF
Abstract
Plagiarism is the activity of duplicating or imitating the work of others then recognized as his own work without the author's permission or listing the source. Plagiarism or plagiarism is not something that is difficult to do because by using a copy-paste-modify technique in part or all of the document, the document can be said to be the result of plagiarism or duplication.
The practice of plagiarism occurs because students are accustomed to taking the writings of others without including the source of origin, even copying in its entirety and exactly the same. Plagiarism practices are mostly carried out by students, especially when completing the final project or thesis
One way that can be used to prevent the practice of plagiarism is by doing prevention and detecting. Plagiarism detection uses the concept of similarity or document similarity is one way to detect copy & paste plagiarism and disguised plagiarism. one of the right methods that can be done to detect plagiarism by analyzing the level of document plagiarism using the Cosine Similarity method and the TF-IDF weighting.
This research produces an application that is able to process the similarity value of the document to be tested. Hasik testing shows that it is appropriate between manual calculations and implementation of algorithms in the application made. Use of the Literature Library is quite effective in the Stemming process. Calculations that use stemming will have a higher similarity value compared to calculations without stemming methods.
References
[2] S. Sastroasmoro., 2006, Beberapa catatan tentang, Majalah Kedokteran Indonesia, Vol. 55, Hal. 1.
[3] Salmuasih., Sunyoto Andi., 2013, Implementasi Algoritma Rabin Karp untuk pendeteksian Plagiat Dokumen Teks Menggunakan Konsep Similarity, Seminar Nasional Aplikasi Teknologi Informasi (SNATI), Yogyakarta.
[4] Pemerintah Indonesia. 2010. Peraturan Mendiknas Republik Indonesia No. 17 Tahun 2010 Tentang Pencegahan dan Penanggulangan Plagiat di Perguruan Tinggi. Lembaran Negara RI Tahun 2010. Kemendikbud. Jakarta.
[5] Qaiser, Shahzad., 2018, Text Mining: Use of TF-IDF to Examine the Relevance of
Words to Documents, International Journal of Computer Applications (0975 – 8887)
Volume 181 – No.1, July 2018.
[6] H. Wu and R. Luk and K. Wong and K. Kwok., 2008, Interpreting TF-IDF term weights as making relevance decisions, ACM Transactions on Information Systems, 26 (3).
Copyright (c) 2022 TEKNIMEDIA: Teknologi Informasi dan Multimedia
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Semua tulisan pada jurnal ini menjadi tanggungjawab penuh penulis. Jurnal Teknimedia memberikan akses terbuka terhadap siapapun agar informasi dan temuan pada artikel tersebut bermanfaat bagi semua orang. Jurnal Teknimedia dapat diakses dan diunduh secara gratis, tanpa dipungut biaya, sesuai dengan lisensi creative commons yang digunakan.
Jurnal TEKNIMEDIA : Teknologi Informasi dan Multimedia is licensed under a Lisensi Creative Commons Atribusi-BerbagiSerupa 4.0 Internasional