ANALISIS TINGKAT PLAGIASI DOKUMEN SKRIPSI DENGAN METODE COSINE SIMILARITY DAN PEMBOBOTAN TF-IDF

  • Muhammad Azmi STMIK Syaikh Zainuddin NW Anjani
Keywords: Plagiarism, Cosine Similarity, Weighting TF-IDF

Abstract

Plagiarism is the activity of duplicating or imitating the work of others then recognized as his own work without the author's permission or listing the source. Plagiarism or plagiarism is not something that is difficult to do because by using a copy-paste-modify technique in part or all of the document, the document can be said to be the result of plagiarism or duplication.

The practice of plagiarism occurs because students are accustomed to taking the writings of others without including the source of origin, even copying in its entirety and exactly the same. Plagiarism practices are mostly carried out by students, especially when completing the final project or thesis

One way that can be used to prevent the practice of plagiarism is by doing prevention and detecting. Plagiarism detection uses the concept of similarity or document similarity is one way to detect copy & paste plagiarism and disguised plagiarism. one of the right methods that can be done to detect plagiarism by analyzing the level of document plagiarism using the Cosine Similarity method and the TF-IDF weighting.

This research produces an application that is able to process the similarity value of the document to be tested. Hasik testing shows that it is appropriate between manual calculations and implementation of algorithms in the application made. Use of the Literature Library is quite effective in the Stemming process. Calculations that use stemming will have a higher similarity value compared to calculations without stemming methods.

References

[1] Irianto, WA., 2014, Penentuan Tingkat Plagiarisme Dokumen Penelitian Menggunakan Cenroid Lingkage Hierarchical Method (Clhm), Jurnal Program Teknologi Informasi Dan Ilmu Komputer. Universitas Brawijaya Malang.
[2] S. Sastroasmoro., 2006, Beberapa catatan tentang, Majalah Kedokteran Indonesia, Vol. 55, Hal. 1.
[3] Salmuasih., Sunyoto Andi., 2013, Implementasi Algoritma Rabin Karp untuk pendeteksian Plagiat Dokumen Teks Menggunakan Konsep Similarity, Seminar Nasional Aplikasi Teknologi Informasi (SNATI), Yogyakarta.
[4] Pemerintah Indonesia. 2010. Peraturan Mendiknas Republik Indonesia No. 17 Tahun 2010 Tentang Pencegahan dan Penanggulangan Plagiat di Perguruan Tinggi. Lembaran Negara RI Tahun 2010. Kemendikbud. Jakarta.
[5] Qaiser, Shahzad., 2018, Text Mining: Use of TF-IDF to Examine the Relevance of
Words to Documents, International Journal of Computer Applications (0975 – 8887)
Volume 181 – No.1, July 2018.
[6] H. Wu and R. Luk and K. Wong and K. Kwok., 2008, Interpreting TF-IDF term weights as making relevance decisions, ACM Transactions on Information Systems, 26 (3).
Published
2022-01-15
How to Cite
Azmi, M. (2022). ANALISIS TINGKAT PLAGIASI DOKUMEN SKRIPSI DENGAN METODE COSINE SIMILARITY DAN PEMBOBOTAN TF-IDF. TEKNIMEDIA: Teknologi Informasi Dan Multimedia, 2(2), 90-95. https://doi.org/10.46764/teknimedia.v2i2.51
Abstract viewed = 244 times
PDF downloaded = 340 times