Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection

Detection of local text reuse is central to a variety of applications, including plagiarism detection, origin detection, and information flow analysis. This paper evaluates and compares effectiveness of fingerprint selection algorithms for the source retrieval stage of local text reuse detection. In...

Full description

Bibliographic Details
Main Author: Jēkabsons Gints
Format: Article
Language:English
Published: Sciendo 2020-05-01
Series:Applied Computer Systems
Subjects:
Online Access:https://doi.org/10.2478/acss-2020-0002
id doaj-1e32584c7aab49fc8a97a673a4ab48ed
record_format Article
spelling doaj-1e32584c7aab49fc8a97a673a4ab48ed2021-09-06T19:41:00ZengSciendoApplied Computer Systems2255-86912020-05-01251111810.2478/acss-2020-0002acss-2020-0002Evaluation of Fingerprint Selection Algorithms for Local Text Reuse DetectionJēkabsons Gints0Riga Technical University, Riga, LatviaDetection of local text reuse is central to a variety of applications, including plagiarism detection, origin detection, and information flow analysis. This paper evaluates and compares effectiveness of fingerprint selection algorithms for the source retrieval stage of local text reuse detection. In total, six algorithms are compared – Every p-th, 0 mod p, Winnowing, Hailstorm, Frequency-biased Winnowing (FBW), as well as the proposed modified version of FBW (MFBW).https://doi.org/10.2478/acss-2020-0002document fingerprintingfingerprint selectionlocal text reuse detectionplagiarism detection
collection DOAJ
language English
format Article
sources DOAJ
author Jēkabsons Gints
spellingShingle Jēkabsons Gints
Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
Applied Computer Systems
document fingerprinting
fingerprint selection
local text reuse detection
plagiarism detection
author_facet Jēkabsons Gints
author_sort Jēkabsons Gints
title Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
title_short Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
title_full Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
title_fullStr Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
title_full_unstemmed Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection
title_sort evaluation of fingerprint selection algorithms for local text reuse detection
publisher Sciendo
series Applied Computer Systems
issn 2255-8691
publishDate 2020-05-01
description Detection of local text reuse is central to a variety of applications, including plagiarism detection, origin detection, and information flow analysis. This paper evaluates and compares effectiveness of fingerprint selection algorithms for the source retrieval stage of local text reuse detection. In total, six algorithms are compared – Every p-th, 0 mod p, Winnowing, Hailstorm, Frequency-biased Winnowing (FBW), as well as the proposed modified version of FBW (MFBW).
topic document fingerprinting
fingerprint selection
local text reuse detection
plagiarism detection
url https://doi.org/10.2478/acss-2020-0002
work_keys_str_mv AT jekabsonsgints evaluationoffingerprintselectionalgorithmsforlocaltextreusedetection
_version_ 1717767181367771136