A data driven methodology for social science research with left-behind children as a case study.

For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard r...

Full description

Bibliographic Details
Main Authors: Chao Wu, Guolong Wang, Simon Hu, Yue Liu, Hong Mi, Ye Zhou, Yi-Ke Guo, Tongtong Song
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0242483
id doaj-1b4a7320ace54fff8ac424d66ab3774c
record_format Article
spelling doaj-1b4a7320ace54fff8ac424d66ab3774c2021-03-04T12:28:07ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011511e024248310.1371/journal.pone.0242483A data driven methodology for social science research with left-behind children as a case study.Chao WuGuolong WangSimon HuYue LiuHong MiYe ZhouYi-Ke GuoTongtong SongFor decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included.https://doi.org/10.1371/journal.pone.0242483
collection DOAJ
language English
format Article
sources DOAJ
author Chao Wu
Guolong Wang
Simon Hu
Yue Liu
Hong Mi
Ye Zhou
Yi-Ke Guo
Tongtong Song
spellingShingle Chao Wu
Guolong Wang
Simon Hu
Yue Liu
Hong Mi
Ye Zhou
Yi-Ke Guo
Tongtong Song
A data driven methodology for social science research with left-behind children as a case study.
PLoS ONE
author_facet Chao Wu
Guolong Wang
Simon Hu
Yue Liu
Hong Mi
Ye Zhou
Yi-Ke Guo
Tongtong Song
author_sort Chao Wu
title A data driven methodology for social science research with left-behind children as a case study.
title_short A data driven methodology for social science research with left-behind children as a case study.
title_full A data driven methodology for social science research with left-behind children as a case study.
title_fullStr A data driven methodology for social science research with left-behind children as a case study.
title_full_unstemmed A data driven methodology for social science research with left-behind children as a case study.
title_sort data driven methodology for social science research with left-behind children as a case study.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included.
url https://doi.org/10.1371/journal.pone.0242483
work_keys_str_mv AT chaowu adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT guolongwang adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT simonhu adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yueliu adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT hongmi adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yezhou adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yikeguo adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT tongtongsong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT chaowu datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT guolongwang datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT simonhu datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yueliu datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT hongmi datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yezhou datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT yikeguo datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT tongtongsong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
_version_ 1714802584848433152