Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media

Social media is the source of data for different purposes: advertisement, social study, human recruiting. However, usually, we are limited to readily available, structured information: age, gender, education, occupation. We have to work with unstructured data such as texts related to a user if we wa...

Full description

Bibliographic Details
Main Authors: Alexander Egorov, Timur Sokhin, Nikolay Butakov
Format: Article
Language:English
Published: FRUCT 2020-09-01
Series:Proceedings of the XXth Conference of Open Innovations Association FRUCT
Subjects:
Online Access:https://www.fruct.org/publications/fruct27/files/Ego.pdf
id doaj-68a2a33992374de8b95a1803029889c3
record_format Article
spelling doaj-68a2a33992374de8b95a1803029889c32020-11-25T04:01:06ZengFRUCTProceedings of the XXth Conference of Open Innovations Association FRUCT2305-72542343-07372020-09-01271546010.23919/FRUCT49677.2020.9211021Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social MediaAlexander Egorov0Timur Sokhin1Nikolay Butakov2ITMO University, RussiaITMO University, RussiaITMO university, RussiaSocial media is the source of data for different purposes: advertisement, social study, human recruiting. However, usually, we are limited to readily available, structured information: age, gender, education, occupation. We have to work with unstructured data such as texts related to a user if we want to extract more complex, implicit features. We show the case of complex user analysis in social media using textual data. The task we solve is detecting parents on social networks. Our approach works with content that is not generated by a user, but with the content, the user was interested in implicitly - the user liked, or explicitly - the user subscribed to a group, where the content was published. In this paper, we compare classification methods for the task of parents detection on social media. Using mentioned above user's likes and other information it is required to estimate chances if a user has got a child or children already or not. This task is an example of positive-unlabeled learning: data from social networks and media may contain explicit signals about users' parenthood but there is no ground to make a backward conclusion. It can be considered as a case of look-a-like modelling or in other words a one-class classification problem. We propose a retrospective approach that can exploit data from social media to allow building a binary classifier. We compare both these approaches and conclude that the retrospective approach albeit requiring more efforts to be implemented may yield better results. This approach may be useful in similar tasks having look-a-like problem statement.https://www.fruct.org/publications/fruct27/files/Ego.pdfsocial mediaclassificationone-class problemretrospective analysisparents detection
collection DOAJ
language English
format Article
sources DOAJ
author Alexander Egorov
Timur Sokhin
Nikolay Butakov
spellingShingle Alexander Egorov
Timur Sokhin
Nikolay Butakov
Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
Proceedings of the XXth Conference of Open Innovations Association FRUCT
social media
classification
one-class problem
retrospective analysis
parents detection
author_facet Alexander Egorov
Timur Sokhin
Nikolay Butakov
author_sort Alexander Egorov
title Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
title_short Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
title_full Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
title_fullStr Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
title_full_unstemmed Towards a Retrospective One-Class Oriented Approach To Parents Detection In Social Media
title_sort towards a retrospective one-class oriented approach to parents detection in social media
publisher FRUCT
series Proceedings of the XXth Conference of Open Innovations Association FRUCT
issn 2305-7254
2343-0737
publishDate 2020-09-01
description Social media is the source of data for different purposes: advertisement, social study, human recruiting. However, usually, we are limited to readily available, structured information: age, gender, education, occupation. We have to work with unstructured data such as texts related to a user if we want to extract more complex, implicit features. We show the case of complex user analysis in social media using textual data. The task we solve is detecting parents on social networks. Our approach works with content that is not generated by a user, but with the content, the user was interested in implicitly - the user liked, or explicitly - the user subscribed to a group, where the content was published. In this paper, we compare classification methods for the task of parents detection on social media. Using mentioned above user's likes and other information it is required to estimate chances if a user has got a child or children already or not. This task is an example of positive-unlabeled learning: data from social networks and media may contain explicit signals about users' parenthood but there is no ground to make a backward conclusion. It can be considered as a case of look-a-like modelling or in other words a one-class classification problem. We propose a retrospective approach that can exploit data from social media to allow building a binary classifier. We compare both these approaches and conclude that the retrospective approach albeit requiring more efforts to be implemented may yield better results. This approach may be useful in similar tasks having look-a-like problem statement.
topic social media
classification
one-class problem
retrospective analysis
parents detection
url https://www.fruct.org/publications/fruct27/files/Ego.pdf
work_keys_str_mv AT alexanderegorov towardsaretrospectiveoneclassorientedapproachtoparentsdetectioninsocialmedia
AT timursokhin towardsaretrospectiveoneclassorientedapproachtoparentsdetectioninsocialmedia
AT nikolaybutakov towardsaretrospectiveoneclassorientedapproachtoparentsdetectioninsocialmedia
_version_ 1724447713296121856