Exploring demographic information in online social networks for improving content classification

The daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several f...

Full description

Bibliographic Details
Main Authors: Randa Benkhelifa, Fatima Zohra Laallam
Format: Article
Language:English
Published: Elsevier 2020-11-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157818302751
id doaj-29d7098478124cf79aebc0382a3a1a92
record_format Article
spelling doaj-29d7098478124cf79aebc0382a3a1a922020-11-25T02:42:08ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782020-11-0132910341044Exploring demographic information in online social networks for improving content classificationRanda Benkhelifa0Fatima Zohra Laallam1Corresponding author.; Faculté des nouvelles technologies de l’information et de la communication, Laboratoire de l’intelligence artificielle et des technologies de l'information, Université Kasdi Merbah Ouargla, Ouargla 30 000, AlgeriaFaculté des nouvelles technologies de l’information et de la communication, Laboratoire de l’intelligence artificielle et des technologies de l'information, Université Kasdi Merbah Ouargla, Ouargla 30 000, AlgeriaThe daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several fields such as trendsetting, future prediction, recommendation systems, community detection, and marketing. Most of the existing studies on text classification in OSNs rely on content based approach, in order to capture users interests through exploiting and categorizing the unstructured textual content shared by those users according to their topics. Moreover, users public profiles available on OSNs often reveal their demographic attributes such as age, gender, education, marital status, etc., which can play an essential role in identifying users interests and preferences. User demographic attributes can provide some preferences for some topics of interests. People with different demographic attributes may be interested in different topics, while people with similar demographic attributes may have the same interests. Usually, young people are more interested in technology than old people, who are more interested in the political news than young people. In this paper, we propose a demographic-content-based approach which uses both users demographic attributes and the textual content to classify OSNs posts using six classifiers ANN, k-NN, Naïve Bayes, Decision Tree, Decision rules and SVM. The experiments are done on a large Facebook dataset in order to analyze the effect of these demographic attributes on the performance of the categorization of the shared textual content in OSNs.http://www.sciencedirect.com/science/article/pii/S1319157818302751Demographic attributesOnline social networksMachine learningText categorizationText miningFeature extraction
collection DOAJ
language English
format Article
sources DOAJ
author Randa Benkhelifa
Fatima Zohra Laallam
spellingShingle Randa Benkhelifa
Fatima Zohra Laallam
Exploring demographic information in online social networks for improving content classification
Journal of King Saud University: Computer and Information Sciences
Demographic attributes
Online social networks
Machine learning
Text categorization
Text mining
Feature extraction
author_facet Randa Benkhelifa
Fatima Zohra Laallam
author_sort Randa Benkhelifa
title Exploring demographic information in online social networks for improving content classification
title_short Exploring demographic information in online social networks for improving content classification
title_full Exploring demographic information in online social networks for improving content classification
title_fullStr Exploring demographic information in online social networks for improving content classification
title_full_unstemmed Exploring demographic information in online social networks for improving content classification
title_sort exploring demographic information in online social networks for improving content classification
publisher Elsevier
series Journal of King Saud University: Computer and Information Sciences
issn 1319-1578
publishDate 2020-11-01
description The daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several fields such as trendsetting, future prediction, recommendation systems, community detection, and marketing. Most of the existing studies on text classification in OSNs rely on content based approach, in order to capture users interests through exploiting and categorizing the unstructured textual content shared by those users according to their topics. Moreover, users public profiles available on OSNs often reveal their demographic attributes such as age, gender, education, marital status, etc., which can play an essential role in identifying users interests and preferences. User demographic attributes can provide some preferences for some topics of interests. People with different demographic attributes may be interested in different topics, while people with similar demographic attributes may have the same interests. Usually, young people are more interested in technology than old people, who are more interested in the political news than young people. In this paper, we propose a demographic-content-based approach which uses both users demographic attributes and the textual content to classify OSNs posts using six classifiers ANN, k-NN, Naïve Bayes, Decision Tree, Decision rules and SVM. The experiments are done on a large Facebook dataset in order to analyze the effect of these demographic attributes on the performance of the categorization of the shared textual content in OSNs.
topic Demographic attributes
Online social networks
Machine learning
Text categorization
Text mining
Feature extraction
url http://www.sciencedirect.com/science/article/pii/S1319157818302751
work_keys_str_mv AT randabenkhelifa exploringdemographicinformationinonlinesocialnetworksforimprovingcontentclassification
AT fatimazohralaallam exploringdemographicinformationinonlinesocialnetworksforimprovingcontentclassification
_version_ 1724775097526386688