Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach

Online social networks are highly dynamic and sparse. One of the main problems in analyzing these networks is the problem of predicting the existence of links between users on these networks: the link prediction problem. Many studies have been conducted to predict links using a variety of techniques...

Full description

Bibliographic Details
Main Authors: Elkabani Islam, Aboo Khachfeh Roa A.
Format: Article
Language:English
Published: De Gruyter 2015-12-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2014-0031
id doaj-61379d380fe84355ba592cde1f8bb42e
record_format Article
spelling doaj-61379d380fe84355ba592cde1f8bb42e2021-09-06T19:40:35ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2015-12-0124449150310.1515/jisys-2014-0031Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets ApproachElkabani IslamAboo Khachfeh Roa A.0Faculty of Science, Mathematics and Computer Science Department, Beirut Arab University, P.O. Box 11-5020 Riad El Solh 11072809, Beirut, LebanonOnline social networks are highly dynamic and sparse. One of the main problems in analyzing these networks is the problem of predicting the existence of links between users on these networks: the link prediction problem. Many studies have been conducted to predict links using a variety of techniques like the decision tree and the logistic regression approaches. In this work, we will illustrate the use of the rough set theory in predicting links over the Facebook social network based on homophilic features. Other supervised learning algorithms are also employed in our experiments and compared with the rough set classifier, such as naive Bayes, J48 decision tree, support vector machine, logistic regression, and multilayer perceptron neural network. Moreover, we studied the influence of the “common groups” and “common page likes” homophilic features on predicting friendship between users of Facebook, and also studied the effect of using the Jaccard coefficient in measuring the similarity between users’ homophilic attributes compared with using the overlap coefficient. We conducted our experiments on two different datasets obtained from the Facebook online social network, where users in each dataset live within the same geographical region. The results showed that the rough set classifier significantly outperformed the other classifiers in all experiments. The results also demonstrated that the common groups and the common page likes features have a significant influence on predicting the friendship between users of Facebook. Finally, the results revealed that using the overlap coefficient homophilic features provided better results than that of the Jaccard coefficient features.https://doi.org/10.1515/jisys-2014-0031link predictionhomophilyonline social networksrough set theory
collection DOAJ
language English
format Article
sources DOAJ
author Elkabani Islam
Aboo Khachfeh Roa A.
spellingShingle Elkabani Islam
Aboo Khachfeh Roa A.
Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
Journal of Intelligent Systems
link prediction
homophily
online social networks
rough set theory
author_facet Elkabani Islam
Aboo Khachfeh Roa A.
author_sort Elkabani Islam
title Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
title_short Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
title_full Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
title_fullStr Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
title_full_unstemmed Homophily-Based Link Prediction in The Facebook Online Social Network: A Rough Sets Approach
title_sort homophily-based link prediction in the facebook online social network: a rough sets approach
publisher De Gruyter
series Journal of Intelligent Systems
issn 0334-1860
2191-026X
publishDate 2015-12-01
description Online social networks are highly dynamic and sparse. One of the main problems in analyzing these networks is the problem of predicting the existence of links between users on these networks: the link prediction problem. Many studies have been conducted to predict links using a variety of techniques like the decision tree and the logistic regression approaches. In this work, we will illustrate the use of the rough set theory in predicting links over the Facebook social network based on homophilic features. Other supervised learning algorithms are also employed in our experiments and compared with the rough set classifier, such as naive Bayes, J48 decision tree, support vector machine, logistic regression, and multilayer perceptron neural network. Moreover, we studied the influence of the “common groups” and “common page likes” homophilic features on predicting friendship between users of Facebook, and also studied the effect of using the Jaccard coefficient in measuring the similarity between users’ homophilic attributes compared with using the overlap coefficient. We conducted our experiments on two different datasets obtained from the Facebook online social network, where users in each dataset live within the same geographical region. The results showed that the rough set classifier significantly outperformed the other classifiers in all experiments. The results also demonstrated that the common groups and the common page likes features have a significant influence on predicting the friendship between users of Facebook. Finally, the results revealed that using the overlap coefficient homophilic features provided better results than that of the Jaccard coefficient features.
topic link prediction
homophily
online social networks
rough set theory
url https://doi.org/10.1515/jisys-2014-0031
work_keys_str_mv AT elkabaniislam homophilybasedlinkpredictioninthefacebookonlinesocialnetworkaroughsetsapproach
AT abookhachfehroaa homophilybasedlinkpredictioninthefacebookonlinesocialnetworkaroughsetsapproach
_version_ 1717768133782011904