A Review of Text-Based Recommendation Systems

Many websites over the Internet are producing a variety of textual data; such as news, research articles, ebooks, personal blogs, and user reviews. In these websites, the textual data is so large that the process of finding pertinent information by a user often becomes cumbersome. To overcome this i...

Full description

Bibliographic Details
Main Authors: Safia Kanwal, Sidra Nawaz, Muhammad Kamran Malik, Zubair Nawaz
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9354169/
id doaj-dfd3268215cd48509d7428512db2fb77
record_format Article
spelling doaj-dfd3268215cd48509d7428512db2fb772021-03-30T15:08:37ZengIEEEIEEE Access2169-35362021-01-019316383166110.1109/ACCESS.2021.30593129354169A Review of Text-Based Recommendation SystemsSafia Kanwal0https://orcid.org/0000-0003-2041-9696Sidra Nawaz1https://orcid.org/0000-0003-1868-2129Muhammad Kamran Malik2https://orcid.org/0000-0002-1392-8866Zubair Nawaz3https://orcid.org/0000-0002-7989-8401Punjab University College of Information Technology (PUCIT), Lahore, PakistanPunjab University College of Information Technology (PUCIT), Lahore, PakistanPunjab University College of Information Technology (PUCIT), Lahore, PakistanPunjab University College of Information Technology (PUCIT), Lahore, PakistanMany websites over the Internet are producing a variety of textual data; such as news, research articles, ebooks, personal blogs, and user reviews. In these websites, the textual data is so large that the process of finding pertinent information by a user often becomes cumbersome. To overcome this issue, “Text-based Recommendation Systems (RS)” are being developed. They are the systems with the capability to find the relevant information in a minimal time using text as the primary feature. There exist several techniques to build and evaluate such systems. And though a good number of surveys compile the general attributes of recommendation systems, there is still a lack of comprehensive literature review about the text-based recommendation systems. In this paper, we present a review of the latest studies on text-based RS. We have conducted this survey by collecting literature from preeminent digital repositories, that was published during the period 2010-2020. This survey mainly covers the four major aspects of the textual based recommendation systems used in the reviewed literature. The aspects are datasets, feature extraction techniques, computational approaches, and evaluation metrics. As benchmark datasets carry a vital role in any research, publicly available datasets are extensively reviewed in this paper. Moreover, for text-based RS many proprietary datasets are also used, which are not available in the public. But we have consolidated all the attributes of these publically available and proprietary datasets to familiarize these attributes to new researchers. Furthermore, the feature extraction methods from the text are briefed and their usage in the construction of text-based RS are discussed. Later, various computational approaches that use these features are also discussed. To evaluate these systems, some evaluation metrics are adopted. We have presented an overview of these evaluation metrics and diagramed them according to their popularity. The survey concludes that Word Embedding is the widely used feature selection technique in the latest research. The survey also deduces that hybridization of text features with other features enhance the recommendation accuracy. The study highlights the fact that most of the work is on English textual data, and News recommendation is the most popular domain.https://ieeexplore.ieee.org/document/9354169/Recommendation systemsreview of recommendation systemtext-based recommendation system
collection DOAJ
language English
format Article
sources DOAJ
author Safia Kanwal
Sidra Nawaz
Muhammad Kamran Malik
Zubair Nawaz
spellingShingle Safia Kanwal
Sidra Nawaz
Muhammad Kamran Malik
Zubair Nawaz
A Review of Text-Based Recommendation Systems
IEEE Access
Recommendation systems
review of recommendation system
text-based recommendation system
author_facet Safia Kanwal
Sidra Nawaz
Muhammad Kamran Malik
Zubair Nawaz
author_sort Safia Kanwal
title A Review of Text-Based Recommendation Systems
title_short A Review of Text-Based Recommendation Systems
title_full A Review of Text-Based Recommendation Systems
title_fullStr A Review of Text-Based Recommendation Systems
title_full_unstemmed A Review of Text-Based Recommendation Systems
title_sort review of text-based recommendation systems
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Many websites over the Internet are producing a variety of textual data; such as news, research articles, ebooks, personal blogs, and user reviews. In these websites, the textual data is so large that the process of finding pertinent information by a user often becomes cumbersome. To overcome this issue, “Text-based Recommendation Systems (RS)” are being developed. They are the systems with the capability to find the relevant information in a minimal time using text as the primary feature. There exist several techniques to build and evaluate such systems. And though a good number of surveys compile the general attributes of recommendation systems, there is still a lack of comprehensive literature review about the text-based recommendation systems. In this paper, we present a review of the latest studies on text-based RS. We have conducted this survey by collecting literature from preeminent digital repositories, that was published during the period 2010-2020. This survey mainly covers the four major aspects of the textual based recommendation systems used in the reviewed literature. The aspects are datasets, feature extraction techniques, computational approaches, and evaluation metrics. As benchmark datasets carry a vital role in any research, publicly available datasets are extensively reviewed in this paper. Moreover, for text-based RS many proprietary datasets are also used, which are not available in the public. But we have consolidated all the attributes of these publically available and proprietary datasets to familiarize these attributes to new researchers. Furthermore, the feature extraction methods from the text are briefed and their usage in the construction of text-based RS are discussed. Later, various computational approaches that use these features are also discussed. To evaluate these systems, some evaluation metrics are adopted. We have presented an overview of these evaluation metrics and diagramed them according to their popularity. The survey concludes that Word Embedding is the widely used feature selection technique in the latest research. The survey also deduces that hybridization of text features with other features enhance the recommendation accuracy. The study highlights the fact that most of the work is on English textual data, and News recommendation is the most popular domain.
topic Recommendation systems
review of recommendation system
text-based recommendation system
url https://ieeexplore.ieee.org/document/9354169/
work_keys_str_mv AT safiakanwal areviewoftextbasedrecommendationsystems
AT sidranawaz areviewoftextbasedrecommendationsystems
AT muhammadkamranmalik areviewoftextbasedrecommendationsystems
AT zubairnawaz areviewoftextbasedrecommendationsystems
AT safiakanwal reviewoftextbasedrecommendationsystems
AT sidranawaz reviewoftextbasedrecommendationsystems
AT muhammadkamranmalik reviewoftextbasedrecommendationsystems
AT zubairnawaz reviewoftextbasedrecommendationsystems
_version_ 1724180044007342080