Using Entropy in Web Usage Data Preprocessing
The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web por...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-01-01
|
Series: | Entropy |
Subjects: | |
Online Access: | http://www.mdpi.com/1099-4300/20/1/67 |
id |
doaj-f1e54774079d4819a5302c339fceb2ca |
---|---|
record_format |
Article |
spelling |
doaj-f1e54774079d4819a5302c339fceb2ca2020-11-25T00:15:30ZengMDPI AGEntropy1099-43002018-01-012016710.3390/e20010067e20010067Using Entropy in Web Usage Data PreprocessingMichal Munk0Lubomir Benko1Department of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, 949 74 Nitra, SlovakiaInstitute of System Engineering and Informatics, University of Pardubice, Studentska 95, 532 10 Pardubice, Czech RepublicThe paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages.http://www.mdpi.com/1099-4300/20/1/67data preprocessinginformation entropyweb usage miningsession identificationReference Length |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Michal Munk Lubomir Benko |
spellingShingle |
Michal Munk Lubomir Benko Using Entropy in Web Usage Data Preprocessing Entropy data preprocessing information entropy web usage mining session identification Reference Length |
author_facet |
Michal Munk Lubomir Benko |
author_sort |
Michal Munk |
title |
Using Entropy in Web Usage Data Preprocessing |
title_short |
Using Entropy in Web Usage Data Preprocessing |
title_full |
Using Entropy in Web Usage Data Preprocessing |
title_fullStr |
Using Entropy in Web Usage Data Preprocessing |
title_full_unstemmed |
Using Entropy in Web Usage Data Preprocessing |
title_sort |
using entropy in web usage data preprocessing |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2018-01-01 |
description |
The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages. |
topic |
data preprocessing information entropy web usage mining session identification Reference Length |
url |
http://www.mdpi.com/1099-4300/20/1/67 |
work_keys_str_mv |
AT michalmunk usingentropyinwebusagedatapreprocessing AT lubomirbenko usingentropyinwebusagedatapreprocessing |
_version_ |
1725386601263005696 |