Exploiting Document Similarities for Plagiarism Detection
碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate th...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/19494832415803845710 |
id |
ndltd-TW-095NCKU5028066 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NCKU50280662015-10-13T14:16:31Z http://ndltd.ncl.edu.tw/handle/19494832415803845710 Exploiting Document Similarities for Plagiarism Detection 有效利用文件相似度之剽竊偵測方法 Heng-rui Zhang 張恒瑞 碩士 國立成功大學 工程科學系碩博士班 95 As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way. Wei-Guang Teng 鄧維光 2007 學位論文 ; thesis 45 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way.
|
author2 |
Wei-Guang Teng |
author_facet |
Wei-Guang Teng Heng-rui Zhang 張恒瑞 |
author |
Heng-rui Zhang 張恒瑞 |
spellingShingle |
Heng-rui Zhang 張恒瑞 Exploiting Document Similarities for Plagiarism Detection |
author_sort |
Heng-rui Zhang |
title |
Exploiting Document Similarities for Plagiarism Detection |
title_short |
Exploiting Document Similarities for Plagiarism Detection |
title_full |
Exploiting Document Similarities for Plagiarism Detection |
title_fullStr |
Exploiting Document Similarities for Plagiarism Detection |
title_full_unstemmed |
Exploiting Document Similarities for Plagiarism Detection |
title_sort |
exploiting document similarities for plagiarism detection |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/19494832415803845710 |
work_keys_str_mv |
AT hengruizhang exploitingdocumentsimilaritiesforplagiarismdetection AT zhānghéngruì exploitingdocumentsimilaritiesforplagiarismdetection AT hengruizhang yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ AT zhānghéngruì yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ |
_version_ |
1717750889513484288 |