Using Entropy in Web Usage Data Preprocessing

The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web por...

Full description

Bibliographic Details
Main Authors: Michal Munk, Lubomir Benko
Format: Article
Language:English
Published: MDPI AG 2018-01-01
Series:Entropy
Subjects:
Online Access:http://www.mdpi.com/1099-4300/20/1/67
id doaj-f1e54774079d4819a5302c339fceb2ca
record_format Article
spelling doaj-f1e54774079d4819a5302c339fceb2ca2020-11-25T00:15:30ZengMDPI AGEntropy1099-43002018-01-012016710.3390/e20010067e20010067Using Entropy in Web Usage Data PreprocessingMichal Munk0Lubomir Benko1Department of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, 949 74 Nitra, SlovakiaInstitute of System Engineering and Informatics, University of Pardubice, Studentska 95, 532 10 Pardubice, Czech RepublicThe paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages.http://www.mdpi.com/1099-4300/20/1/67data preprocessinginformation entropyweb usage miningsession identificationReference Length
collection DOAJ
language English
format Article
sources DOAJ
author Michal Munk
Lubomir Benko
spellingShingle Michal Munk
Lubomir Benko
Using Entropy in Web Usage Data Preprocessing
Entropy
data preprocessing
information entropy
web usage mining
session identification
Reference Length
author_facet Michal Munk
Lubomir Benko
author_sort Michal Munk
title Using Entropy in Web Usage Data Preprocessing
title_short Using Entropy in Web Usage Data Preprocessing
title_full Using Entropy in Web Usage Data Preprocessing
title_fullStr Using Entropy in Web Usage Data Preprocessing
title_full_unstemmed Using Entropy in Web Usage Data Preprocessing
title_sort using entropy in web usage data preprocessing
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2018-01-01
description The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages.
topic data preprocessing
information entropy
web usage mining
session identification
Reference Length
url http://www.mdpi.com/1099-4300/20/1/67
work_keys_str_mv AT michalmunk usingentropyinwebusagedatapreprocessing
AT lubomirbenko usingentropyinwebusagedatapreprocessing
_version_ 1725386601263005696