Exploring demographic information in online social networks for improving content classification
The daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several f...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-11-01
|
Series: | Journal of King Saud University: Computer and Information Sciences |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1319157818302751 |
id |
doaj-29d7098478124cf79aebc0382a3a1a92 |
---|---|
record_format |
Article |
spelling |
doaj-29d7098478124cf79aebc0382a3a1a922020-11-25T02:42:08ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782020-11-0132910341044Exploring demographic information in online social networks for improving content classificationRanda Benkhelifa0Fatima Zohra Laallam1Corresponding author.; Faculté des nouvelles technologies de l’information et de la communication, Laboratoire de l’intelligence artificielle et des technologies de l'information, Université Kasdi Merbah Ouargla, Ouargla 30 000, AlgeriaFaculté des nouvelles technologies de l’information et de la communication, Laboratoire de l’intelligence artificielle et des technologies de l'information, Université Kasdi Merbah Ouargla, Ouargla 30 000, AlgeriaThe daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several fields such as trendsetting, future prediction, recommendation systems, community detection, and marketing. Most of the existing studies on text classification in OSNs rely on content based approach, in order to capture users interests through exploiting and categorizing the unstructured textual content shared by those users according to their topics. Moreover, users public profiles available on OSNs often reveal their demographic attributes such as age, gender, education, marital status, etc., which can play an essential role in identifying users interests and preferences. User demographic attributes can provide some preferences for some topics of interests. People with different demographic attributes may be interested in different topics, while people with similar demographic attributes may have the same interests. Usually, young people are more interested in technology than old people, who are more interested in the political news than young people. In this paper, we propose a demographic-content-based approach which uses both users demographic attributes and the textual content to classify OSNs posts using six classifiers ANN, k-NN, Naïve Bayes, Decision Tree, Decision rules and SVM. The experiments are done on a large Facebook dataset in order to analyze the effect of these demographic attributes on the performance of the categorization of the shared textual content in OSNs.http://www.sciencedirect.com/science/article/pii/S1319157818302751Demographic attributesOnline social networksMachine learningText categorizationText miningFeature extraction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Randa Benkhelifa Fatima Zohra Laallam |
spellingShingle |
Randa Benkhelifa Fatima Zohra Laallam Exploring demographic information in online social networks for improving content classification Journal of King Saud University: Computer and Information Sciences Demographic attributes Online social networks Machine learning Text categorization Text mining Feature extraction |
author_facet |
Randa Benkhelifa Fatima Zohra Laallam |
author_sort |
Randa Benkhelifa |
title |
Exploring demographic information in online social networks for improving content classification |
title_short |
Exploring demographic information in online social networks for improving content classification |
title_full |
Exploring demographic information in online social networks for improving content classification |
title_fullStr |
Exploring demographic information in online social networks for improving content classification |
title_full_unstemmed |
Exploring demographic information in online social networks for improving content classification |
title_sort |
exploring demographic information in online social networks for improving content classification |
publisher |
Elsevier |
series |
Journal of King Saud University: Computer and Information Sciences |
issn |
1319-1578 |
publishDate |
2020-11-01 |
description |
The daily interaction between users within online social networks (OSNs) is an effective way to analyze and interpret its context in real time in order to capture the interests, preferences, and concerns of the OSNs users. These offer a unique information source for several applications in several fields such as trendsetting, future prediction, recommendation systems, community detection, and marketing. Most of the existing studies on text classification in OSNs rely on content based approach, in order to capture users interests through exploiting and categorizing the unstructured textual content shared by those users according to their topics. Moreover, users public profiles available on OSNs often reveal their demographic attributes such as age, gender, education, marital status, etc., which can play an essential role in identifying users interests and preferences. User demographic attributes can provide some preferences for some topics of interests. People with different demographic attributes may be interested in different topics, while people with similar demographic attributes may have the same interests. Usually, young people are more interested in technology than old people, who are more interested in the political news than young people. In this paper, we propose a demographic-content-based approach which uses both users demographic attributes and the textual content to classify OSNs posts using six classifiers ANN, k-NN, Naïve Bayes, Decision Tree, Decision rules and SVM. The experiments are done on a large Facebook dataset in order to analyze the effect of these demographic attributes on the performance of the categorization of the shared textual content in OSNs. |
topic |
Demographic attributes Online social networks Machine learning Text categorization Text mining Feature extraction |
url |
http://www.sciencedirect.com/science/article/pii/S1319157818302751 |
work_keys_str_mv |
AT randabenkhelifa exploringdemographicinformationinonlinesocialnetworksforimprovingcontentclassification AT fatimazohralaallam exploringdemographicinformationinonlinesocialnetworksforimprovingcontentclassification |
_version_ |
1724775097526386688 |