Correlates of physical activity behavior in adults: a data mining approach
Abstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collect...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2020-07-01
|
Series: | International Journal of Behavioral Nutrition and Physical Activity |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12966-020-00996-7 |
id |
doaj-67b44416b3bf4011a2017a3b857db421 |
---|---|
record_format |
Article |
spelling |
doaj-67b44416b3bf4011a2017a3b857db4212020-11-25T03:09:18ZengBMCInternational Journal of Behavioral Nutrition and Physical Activity1479-58682020-07-0117111410.1186/s12966-020-00996-7Correlates of physical activity behavior in adults: a data mining approachVahid Farrahi0Maisa Niemelä1Mikko Kärmeniemi2Soile Puhakka3Maarit Kangas4Raija Korpelainen5Timo Jämsä6Research Unit of Medical Imaging, Physics and Technology, University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluMedical Research Center, Oulu University Hospital and University of OuluCenter for Life Course Health Research, University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluMedical Research Center, Oulu University Hospital and University of OuluResearch Unit of Medical Imaging, Physics and Technology, University of OuluAbstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collected in the most recent follow-up at age 46, were used to create a hierarchy using the chi-square automatic interaction detection (CHAID) decision tree technique for predicting PA behavior. PA behavior is defined as active or inactive based on machine-learned activity profiles, which were previously created through a multidimensional (clustering) approach on continuous accelerometer-measured activity intensities in one week. The input variables (predictors) used for decision tree fitting consisted of individual, demographical, psychological, behavioral, environmental, and physical factors. Using generalized linear mixed models, we also analyzed how factors emerging from the model were associated with three PA metrics, including daily time (minutes per day) in sedentary (SED), light PA (LPA), and moderate-to-vigorous PA (MVPA), to assure the relative importance of methodologically identified factors. Results Of the 4582 participants with valid accelerometer data at the latest follow-up, 2701 and 1881 had active and inactive profiles, respectively. We used a total of 168 factors as input variables to classify these two PA behaviors. Out of these 168 factors, the decision tree selected 36 factors of different domains from which 54 subgroups of participants were formed. The emerging factors from the model explained minutes per day in SED, LPA, and/or MVPA, including body fat percentage (SED: B = 26.5, LPA: B = − 16.1, and MVPA: B = − 11.7), normalized heart rate recovery 60 s after exercise (SED: B = -16.1, LPA: B = 9.9, and MVPA: B = 9.6), average weekday total sitting time (SED: B = 34.1, LPA: B = -25.3, and MVPA: B = -5.8), and extravagance score (SED: B = 6.3 and LPA: B = − 3.7). Conclusions Using data mining, we established a data-driven model composed of 36 different factors of relative importance from empirical data. This model may be used to identify subgroups for multilevel intervention allocation and design. Additionally, this study methodologically discovered an extensive set of factors that can be a basis for additional hypothesis testing in PA correlates research.http://link.springer.com/article/10.1186/s12966-020-00996-7Decision treeCHAIDMultilevel modelPredictionClassification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Vahid Farrahi Maisa Niemelä Mikko Kärmeniemi Soile Puhakka Maarit Kangas Raija Korpelainen Timo Jämsä |
spellingShingle |
Vahid Farrahi Maisa Niemelä Mikko Kärmeniemi Soile Puhakka Maarit Kangas Raija Korpelainen Timo Jämsä Correlates of physical activity behavior in adults: a data mining approach International Journal of Behavioral Nutrition and Physical Activity Decision tree CHAID Multilevel model Prediction Classification |
author_facet |
Vahid Farrahi Maisa Niemelä Mikko Kärmeniemi Soile Puhakka Maarit Kangas Raija Korpelainen Timo Jämsä |
author_sort |
Vahid Farrahi |
title |
Correlates of physical activity behavior in adults: a data mining approach |
title_short |
Correlates of physical activity behavior in adults: a data mining approach |
title_full |
Correlates of physical activity behavior in adults: a data mining approach |
title_fullStr |
Correlates of physical activity behavior in adults: a data mining approach |
title_full_unstemmed |
Correlates of physical activity behavior in adults: a data mining approach |
title_sort |
correlates of physical activity behavior in adults: a data mining approach |
publisher |
BMC |
series |
International Journal of Behavioral Nutrition and Physical Activity |
issn |
1479-5868 |
publishDate |
2020-07-01 |
description |
Abstract Purpose A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. Methods Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collected in the most recent follow-up at age 46, were used to create a hierarchy using the chi-square automatic interaction detection (CHAID) decision tree technique for predicting PA behavior. PA behavior is defined as active or inactive based on machine-learned activity profiles, which were previously created through a multidimensional (clustering) approach on continuous accelerometer-measured activity intensities in one week. The input variables (predictors) used for decision tree fitting consisted of individual, demographical, psychological, behavioral, environmental, and physical factors. Using generalized linear mixed models, we also analyzed how factors emerging from the model were associated with three PA metrics, including daily time (minutes per day) in sedentary (SED), light PA (LPA), and moderate-to-vigorous PA (MVPA), to assure the relative importance of methodologically identified factors. Results Of the 4582 participants with valid accelerometer data at the latest follow-up, 2701 and 1881 had active and inactive profiles, respectively. We used a total of 168 factors as input variables to classify these two PA behaviors. Out of these 168 factors, the decision tree selected 36 factors of different domains from which 54 subgroups of participants were formed. The emerging factors from the model explained minutes per day in SED, LPA, and/or MVPA, including body fat percentage (SED: B = 26.5, LPA: B = − 16.1, and MVPA: B = − 11.7), normalized heart rate recovery 60 s after exercise (SED: B = -16.1, LPA: B = 9.9, and MVPA: B = 9.6), average weekday total sitting time (SED: B = 34.1, LPA: B = -25.3, and MVPA: B = -5.8), and extravagance score (SED: B = 6.3 and LPA: B = − 3.7). Conclusions Using data mining, we established a data-driven model composed of 36 different factors of relative importance from empirical data. This model may be used to identify subgroups for multilevel intervention allocation and design. Additionally, this study methodologically discovered an extensive set of factors that can be a basis for additional hypothesis testing in PA correlates research. |
topic |
Decision tree CHAID Multilevel model Prediction Classification |
url |
http://link.springer.com/article/10.1186/s12966-020-00996-7 |
work_keys_str_mv |
AT vahidfarrahi correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT maisaniemela correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT mikkokarmeniemi correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT soilepuhakka correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT maaritkangas correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT raijakorpelainen correlatesofphysicalactivitybehaviorinadultsadataminingapproach AT timojamsa correlatesofphysicalactivitybehaviorinadultsadataminingapproach |
_version_ |
1724663475385401344 |