Exploiting Document Similarities for Plagiarism Detection

碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate th...

Full description

Bibliographic Details
Main Authors: Heng-rui Zhang, 張恒瑞
Other Authors: Wei-Guang Teng
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/19494832415803845710
id ndltd-TW-095NCKU5028066
record_format oai_dc
spelling ndltd-TW-095NCKU50280662015-10-13T14:16:31Z http://ndltd.ncl.edu.tw/handle/19494832415803845710 Exploiting Document Similarities for Plagiarism Detection 有效利用文件相似度之剽竊偵測方法 Heng-rui Zhang 張恒瑞 碩士 國立成功大學 工程科學系碩博士班 95 As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way. Wei-Guang Teng 鄧維光 2007 學位論文 ; thesis 45 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way.
author2 Wei-Guang Teng
author_facet Wei-Guang Teng
Heng-rui Zhang
張恒瑞
author Heng-rui Zhang
張恒瑞
spellingShingle Heng-rui Zhang
張恒瑞
Exploiting Document Similarities for Plagiarism Detection
author_sort Heng-rui Zhang
title Exploiting Document Similarities for Plagiarism Detection
title_short Exploiting Document Similarities for Plagiarism Detection
title_full Exploiting Document Similarities for Plagiarism Detection
title_fullStr Exploiting Document Similarities for Plagiarism Detection
title_full_unstemmed Exploiting Document Similarities for Plagiarism Detection
title_sort exploiting document similarities for plagiarism detection
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/19494832415803845710
work_keys_str_mv AT hengruizhang exploitingdocumentsimilaritiesforplagiarismdetection
AT zhānghéngruì exploitingdocumentsimilaritiesforplagiarismdetection
AT hengruizhang yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ
AT zhānghéngruì yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ
_version_ 1717750889513484288