Exploiting Document Similarities for Plagiarism Detection

碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate th...

Full description

Bibliographic Details
Main Authors:	Heng-rui Zhang, 張恒瑞
Other Authors:	Wei-Guang Teng
Format:	Others
Language:	en_US
Published:	2007
Online Access:	http://ndltd.ncl.edu.tw/handle/19494832415803845710

id	ndltd-TW-095NCKU5028066
record_format	oai_dc
spelling	ndltd-TW-095NCKU50280662015-10-13T14:16:31Z http://ndltd.ncl.edu.tw/handle/19494832415803845710 Exploiting Document Similarities for Plagiarism Detection 有效利用文件相似度之剽竊偵測方法 Heng-rui Zhang 張恒瑞碩士國立成功大學工程科學系碩博士班 95 As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way. Wei-Guang Teng 鄧維光 2007 學位論文 ; thesis 45 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 工程科學系碩博士班 === 95 === As information and networking technologies advance, people can easily get what they need on the web. This facilitates the learning and sharing processes among people. However, the plagiarism problem is also becoming more and more serious if people depreciate the creativity and intellectual property of others. An effective way to reduce the impacts of plagiarism lies on the detection techniques. In this work, we focus on extending the capabilities of identifying document similarities for plagiarism detection. Specifically, two crucial issues are addressed in this thesis. The first issue is on devising a proper technique to segment a suspicious document into smaller pieces for following steps to identify possibly multiple sources. On the other hand, since a plagiarist may slightly revise the grabbed contents when compiling into the plagiarized document, a technique to identify partial changes in a text segment should be developed. Moreover, our approach is carefully designed to reduce redundant computation cost when conducting comparison of document similarities. To verify the feasibility of our approach, empirical studies show that plagiarized documents and thus the malicious users can be precisely identified in a very efficient way.
author2	Wei-Guang Teng
author_facet	Wei-Guang Teng Heng-rui Zhang 張恒瑞
author	Heng-rui Zhang 張恒瑞
spellingShingle	Heng-rui Zhang 張恒瑞 Exploiting Document Similarities for Plagiarism Detection
author_sort	Heng-rui Zhang
title	Exploiting Document Similarities for Plagiarism Detection
title_short	Exploiting Document Similarities for Plagiarism Detection
title_full	Exploiting Document Similarities for Plagiarism Detection
title_fullStr	Exploiting Document Similarities for Plagiarism Detection
title_full_unstemmed	Exploiting Document Similarities for Plagiarism Detection
title_sort	exploiting document similarities for plagiarism detection
publishDate	2007
url	http://ndltd.ncl.edu.tw/handle/19494832415803845710
work_keys_str_mv	AT hengruizhang exploitingdocumentsimilaritiesforplagiarismdetection AT zhānghéngruì exploitingdocumentsimilaritiesforplagiarismdetection AT hengruizhang yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ AT zhānghéngruì yǒuxiàolìyòngwénjiànxiāngshìdùzhīpiāoqièzhēncèfāngfǎ
_version_	1717750889513484288

Exploiting Document Similarities for Plagiarism Detection

Similar Items