Web News Mining Using New Features: A Comparative Study

Web-based applications are a well-known platform to exchange information between Internet-users. However, in this modern world, the processing of huge information or Big-Data such as web news or web advertisement of product information through users is the main challenge. In another side, such web a...

Full description

Bibliographic Details
Main Author: Halgurd S. Maghdid
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8594593/
id doaj-9da4bb7e550343218b9297d6a5afdcb7
record_format Article
spelling doaj-9da4bb7e550343218b9297d6a5afdcb72021-03-29T22:07:38ZengIEEEIEEE Access2169-35362019-01-0175626564110.1109/ACCESS.2018.28900888594593Web News Mining Using New Features: A Comparative StudyHalgurd S. Maghdid0https://orcid.org/0000-0003-1109-4009Department of Software Engineering, FENG, Koya University, University Park, Koy Sanjaq, IraqWeb-based applications are a well-known platform to exchange information between Internet-users. However, in this modern world, the processing of huge information or Big-Data such as web news or web advertisement of product information through users is the main challenge. In another side, such web applications are the most accessible media for users to get up-to-date information. Equally, these applications need huge computation in terms of spaces and times as well as they drain the battery power of the users’ mobile devices. Therefore, one of the solutions to mitigate these challenges is to mine or extract specific information based on specific features. Furthermore, the features will be the users’ behavior or retrieved information from different sources. This article aims at designing and carrying out a web application to extract news information using new features such as geolocation and time information as well as showing a comparative study on three different mining techniques. The application can run on different devices including Laptops, Smartphones, and Tablets. Moreover, the application can retrieve information features accordingly. Then, the obtained information could be used as a basis for starting or as input for the data-mining techniques, including K-Nearest-Neighbor (k-NN), decision tree and deep-learning recurrent neural network (such as Long Short-Term Memory ‘LSTM’). These techniques are separately implemented and they are compared in terms of time/space complexity and classification accuracy. The obtained results show that the mining accuracy via k-NN is the worst one (~85%) and takes much more time, while the mining accuracy through using LSTM is the best one and its accuracy is around (~94%), when location information is used.https://ieeexplore.ieee.org/document/8594593/Web news applicationdata miningclassificationk-NNdecision treeLSTM
collection DOAJ
language English
format Article
sources DOAJ
author Halgurd S. Maghdid
spellingShingle Halgurd S. Maghdid
Web News Mining Using New Features: A Comparative Study
IEEE Access
Web news application
data mining
classification
k-NN
decision tree
LSTM
author_facet Halgurd S. Maghdid
author_sort Halgurd S. Maghdid
title Web News Mining Using New Features: A Comparative Study
title_short Web News Mining Using New Features: A Comparative Study
title_full Web News Mining Using New Features: A Comparative Study
title_fullStr Web News Mining Using New Features: A Comparative Study
title_full_unstemmed Web News Mining Using New Features: A Comparative Study
title_sort web news mining using new features: a comparative study
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Web-based applications are a well-known platform to exchange information between Internet-users. However, in this modern world, the processing of huge information or Big-Data such as web news or web advertisement of product information through users is the main challenge. In another side, such web applications are the most accessible media for users to get up-to-date information. Equally, these applications need huge computation in terms of spaces and times as well as they drain the battery power of the users’ mobile devices. Therefore, one of the solutions to mitigate these challenges is to mine or extract specific information based on specific features. Furthermore, the features will be the users’ behavior or retrieved information from different sources. This article aims at designing and carrying out a web application to extract news information using new features such as geolocation and time information as well as showing a comparative study on three different mining techniques. The application can run on different devices including Laptops, Smartphones, and Tablets. Moreover, the application can retrieve information features accordingly. Then, the obtained information could be used as a basis for starting or as input for the data-mining techniques, including K-Nearest-Neighbor (k-NN), decision tree and deep-learning recurrent neural network (such as Long Short-Term Memory ‘LSTM’). These techniques are separately implemented and they are compared in terms of time/space complexity and classification accuracy. The obtained results show that the mining accuracy via k-NN is the worst one (~85%) and takes much more time, while the mining accuracy through using LSTM is the best one and its accuracy is around (~94%), when location information is used.
topic Web news application
data mining
classification
k-NN
decision tree
LSTM
url https://ieeexplore.ieee.org/document/8594593/
work_keys_str_mv AT halgurdsmaghdid webnewsminingusingnewfeaturesacomparativestudy
_version_ 1724192194014740480