Plagiarism Detection Using Knowledge-based Techniques

碩士 === 樹德科技大學 === 資訊管理研究所 === 91 === Plagiarism is an important issue in copyright and authorization protection.With the growth of the Internet, it is much more easier for plagiarists to copy from the Internet various materials and put them in their own documents without the permission of...

Full description

Bibliographic Details
Main Authors: Jung-Sheng Yang, 楊榮生
Other Authors: Chih-Hung Wu
Format: Others
Language:en_US
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/18899988613701247679
id ndltd-TW-091STU00396001
record_format oai_dc
spelling ndltd-TW-091STU003960012015-10-13T13:35:30Z http://ndltd.ncl.edu.tw/handle/18899988613701247679 Plagiarism Detection Using Knowledge-based Techniques 使用知識庫的技術進行文件抄襲偵測 Jung-Sheng Yang 楊榮生 碩士 樹德科技大學 資訊管理研究所 91 Plagiarism is an important issue in copyright and authorization protection.With the growth of the Internet, it is much more easier for plagiarists to copy from the Internet various materials and put them in their own documents without the permission of the original authors. Manual detection may be the most accurate detecting approach to this issue. Intuitively, to detect plagiarism of documents is to detect how similar the documents are. There are only several automated systems for plagiarism detection. The complexity and performance of these methods depend on how documents are copied. Most of the methods employ pairwise comparison of the input strings. For the documents which are copied and past without any modications, detecting the existence of a copied area is simple. Nevertheless, plagiarists usually change the order of sentences, add or delete a couple of words, modify the writing style, etc., to prevent from being detected. These approaches perform well on the documents which are completely plagiarized from the sources. It is ob- served that there are a lot of materials available on the Internet and a variety of plagiarizing behaviors. Simple comparison is not sucient for detecting documents that are tailored by sophisticated plagiarists. In this paper, we present how the knowledge-based approach can serve as a exible solution to the detection of plagiarized text. We analyze the types and behaviors of plagiarism and describe documents to be test in form of graph structures. The problem of detecting plagiarism then is rephrased as the comparison of such graphic structures.We consult several senior faculties with our depart- ment for collecting the knowledge of selecting, identifying, and comparing the structures and contents of documents. Based on this idea, a knowledge-based system is implemented in CLIPS. Experimental results show that our approach is workable and eective. Chih-Hung Wu 吳志宏 2003 學位論文 ; thesis 56 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 樹德科技大學 === 資訊管理研究所 === 91 === Plagiarism is an important issue in copyright and authorization protection.With the growth of the Internet, it is much more easier for plagiarists to copy from the Internet various materials and put them in their own documents without the permission of the original authors. Manual detection may be the most accurate detecting approach to this issue. Intuitively, to detect plagiarism of documents is to detect how similar the documents are. There are only several automated systems for plagiarism detection. The complexity and performance of these methods depend on how documents are copied. Most of the methods employ pairwise comparison of the input strings. For the documents which are copied and past without any modications, detecting the existence of a copied area is simple. Nevertheless, plagiarists usually change the order of sentences, add or delete a couple of words, modify the writing style, etc., to prevent from being detected. These approaches perform well on the documents which are completely plagiarized from the sources. It is ob- served that there are a lot of materials available on the Internet and a variety of plagiarizing behaviors. Simple comparison is not sucient for detecting documents that are tailored by sophisticated plagiarists. In this paper, we present how the knowledge-based approach can serve as a exible solution to the detection of plagiarized text. We analyze the types and behaviors of plagiarism and describe documents to be test in form of graph structures. The problem of detecting plagiarism then is rephrased as the comparison of such graphic structures.We consult several senior faculties with our depart- ment for collecting the knowledge of selecting, identifying, and comparing the structures and contents of documents. Based on this idea, a knowledge-based system is implemented in CLIPS. Experimental results show that our approach is workable and eective.
author2 Chih-Hung Wu
author_facet Chih-Hung Wu
Jung-Sheng Yang
楊榮生
author Jung-Sheng Yang
楊榮生
spellingShingle Jung-Sheng Yang
楊榮生
Plagiarism Detection Using Knowledge-based Techniques
author_sort Jung-Sheng Yang
title Plagiarism Detection Using Knowledge-based Techniques
title_short Plagiarism Detection Using Knowledge-based Techniques
title_full Plagiarism Detection Using Knowledge-based Techniques
title_fullStr Plagiarism Detection Using Knowledge-based Techniques
title_full_unstemmed Plagiarism Detection Using Knowledge-based Techniques
title_sort plagiarism detection using knowledge-based techniques
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/18899988613701247679
work_keys_str_mv AT jungshengyang plagiarismdetectionusingknowledgebasedtechniques
AT yángróngshēng plagiarismdetectionusingknowledgebasedtechniques
AT jungshengyang shǐyòngzhīshíkùdejìshùjìnxíngwénjiànchāoxízhēncè
AT yángróngshēng shǐyòngzhīshíkùdejìshùjìnxíngwénjiànchāoxízhēncè
_version_ 1717737806232551424