Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data

BackgroundTime-resolved quantification of physical activity can contribute to both personalized medicine and epidemiological research studies, for example, managing and identifying triggers of asthma exacerbations. A growing number of reportedly accurate machine learning algo...

Full description

Bibliographic Details
Main Authors: Li, Kenan, Habre, Rima, Deng, Huiyu, Urman, Robert, Morrison, John, Gilliland, Frank D, Ambite, José Luis, Stripelis, Dimitris, Chiang, Yao-Yi, Lin, Yijun, Bui, Alex AT, King, Christine, Hosseini, Anahita, Vliet, Eleanne Van, Sarrafzadeh, Majid, Eckel, Sandrah P
Format: Article
Language:English
Published: JMIR Publications 2019-02-01
Series:JMIR mHealth and uHealth
Online Access:http://mhealth.jmir.org/2019/2/e11201/
id doaj-a47035c922a94672a72ed6632c7de107
record_format Article
spelling doaj-a47035c922a94672a72ed6632c7de1072021-05-03T04:33:24ZengJMIR PublicationsJMIR mHealth and uHealth2291-52222019-02-0172e1120110.2196/11201Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ DataLi, KenanHabre, RimaDeng, HuiyuUrman, RobertMorrison, JohnGilliland, Frank DAmbite, José LuisStripelis, DimitrisChiang, Yao-YiLin, YijunBui, Alex ATKing, ChristineHosseini, AnahitaVliet, Eleanne VanSarrafzadeh, MajidEckel, Sandrah P BackgroundTime-resolved quantification of physical activity can contribute to both personalized medicine and epidemiological research studies, for example, managing and identifying triggers of asthma exacerbations. A growing number of reportedly accurate machine learning algorithms for human activity recognition (HAR) have been developed using data from wearable devices (eg, smartwatch and smartphone). However, many HAR algorithms depend on fixed-size sampling windows that may poorly adapt to real-world conditions in which activity bouts are of unequal duration. A small sliding window can produce noisy predictions under stable conditions, whereas a large sliding window may miss brief bursts of intense activity. ObjectiveWe aimed to create an HAR framework adapted to variable duration activity bouts by (1) detecting the change points of activity bouts in a multivariate time series and (2) predicting activity for each homogeneous window defined by these change points. MethodsWe applied standard fixed-width sliding windows (4-6 different sizes) or greedy Gaussian segmentation (GGS) to identify break points in filtered triaxial accelerometer and gyroscope data. After standard feature engineering, we applied an Xgboost model to predict physical activity within each window and then converted windowed predictions to instantaneous predictions to facilitate comparison across segmentation methods. We applied these methods in 2 datasets: the human activity recognition using smartphones (HARuS) dataset where a total of 30 adults performed activities of approximately equal duration (approximately 20 seconds each) while wearing a waist-worn smartphone, and the Biomedical REAl-Time Health Evaluation for Pediatric Asthma (BREATHE) dataset where a total of 14 children performed 6 activities for approximately 10 min each while wearing a smartwatch. To mimic a real-world scenario, we generated artificial unequal activity bout durations in the BREATHE data by randomly subdividing each activity bout into 10 segments and randomly concatenating the 60 activity bouts. Each dataset was divided into ~90% training and ~10% holdout testing. ResultsIn the HARuS data, GGS produced the least noisy predictions of 6 physical activities and had the second highest accuracy rate of 91.06% (the highest accuracy rate was 91.79% for the sliding window of size 0.8 second). In the BREATHE data, GGS again produced the least noisy predictions and had the highest accuracy rate of 79.4% of predictions for 6 physical activities. ConclusionsIn a scenario with variable duration activity bouts, GGS multivariate segmentation produced smart-sized windows with more stable predictions and a higher accuracy rate than traditional fixed-size sliding window approaches. Overall, accuracy was good in both datasets but, as expected, it was slightly lower in the more real-world study using wrist-worn smartwatches in children (BREATHE) than in the more tightly controlled study using waist-worn smartphones in adults (HARuS). We implemented GGS in an offline setting, but it could be adapted for real-time prediction with streaming data.http://mhealth.jmir.org/2019/2/e11201/
collection DOAJ
language English
format Article
sources DOAJ
author Li, Kenan
Habre, Rima
Deng, Huiyu
Urman, Robert
Morrison, John
Gilliland, Frank D
Ambite, José Luis
Stripelis, Dimitris
Chiang, Yao-Yi
Lin, Yijun
Bui, Alex AT
King, Christine
Hosseini, Anahita
Vliet, Eleanne Van
Sarrafzadeh, Majid
Eckel, Sandrah P
spellingShingle Li, Kenan
Habre, Rima
Deng, Huiyu
Urman, Robert
Morrison, John
Gilliland, Frank D
Ambite, José Luis
Stripelis, Dimitris
Chiang, Yao-Yi
Lin, Yijun
Bui, Alex AT
King, Christine
Hosseini, Anahita
Vliet, Eleanne Van
Sarrafzadeh, Majid
Eckel, Sandrah P
Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
JMIR mHealth and uHealth
author_facet Li, Kenan
Habre, Rima
Deng, Huiyu
Urman, Robert
Morrison, John
Gilliland, Frank D
Ambite, José Luis
Stripelis, Dimitris
Chiang, Yao-Yi
Lin, Yijun
Bui, Alex AT
King, Christine
Hosseini, Anahita
Vliet, Eleanne Van
Sarrafzadeh, Majid
Eckel, Sandrah P
author_sort Li, Kenan
title Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
title_short Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
title_full Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
title_fullStr Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
title_full_unstemmed Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data
title_sort applying multivariate segmentation methods to human activity recognition from wearable sensors’ data
publisher JMIR Publications
series JMIR mHealth and uHealth
issn 2291-5222
publishDate 2019-02-01
description BackgroundTime-resolved quantification of physical activity can contribute to both personalized medicine and epidemiological research studies, for example, managing and identifying triggers of asthma exacerbations. A growing number of reportedly accurate machine learning algorithms for human activity recognition (HAR) have been developed using data from wearable devices (eg, smartwatch and smartphone). However, many HAR algorithms depend on fixed-size sampling windows that may poorly adapt to real-world conditions in which activity bouts are of unequal duration. A small sliding window can produce noisy predictions under stable conditions, whereas a large sliding window may miss brief bursts of intense activity. ObjectiveWe aimed to create an HAR framework adapted to variable duration activity bouts by (1) detecting the change points of activity bouts in a multivariate time series and (2) predicting activity for each homogeneous window defined by these change points. MethodsWe applied standard fixed-width sliding windows (4-6 different sizes) or greedy Gaussian segmentation (GGS) to identify break points in filtered triaxial accelerometer and gyroscope data. After standard feature engineering, we applied an Xgboost model to predict physical activity within each window and then converted windowed predictions to instantaneous predictions to facilitate comparison across segmentation methods. We applied these methods in 2 datasets: the human activity recognition using smartphones (HARuS) dataset where a total of 30 adults performed activities of approximately equal duration (approximately 20 seconds each) while wearing a waist-worn smartphone, and the Biomedical REAl-Time Health Evaluation for Pediatric Asthma (BREATHE) dataset where a total of 14 children performed 6 activities for approximately 10 min each while wearing a smartwatch. To mimic a real-world scenario, we generated artificial unequal activity bout durations in the BREATHE data by randomly subdividing each activity bout into 10 segments and randomly concatenating the 60 activity bouts. Each dataset was divided into ~90% training and ~10% holdout testing. ResultsIn the HARuS data, GGS produced the least noisy predictions of 6 physical activities and had the second highest accuracy rate of 91.06% (the highest accuracy rate was 91.79% for the sliding window of size 0.8 second). In the BREATHE data, GGS again produced the least noisy predictions and had the highest accuracy rate of 79.4% of predictions for 6 physical activities. ConclusionsIn a scenario with variable duration activity bouts, GGS multivariate segmentation produced smart-sized windows with more stable predictions and a higher accuracy rate than traditional fixed-size sliding window approaches. Overall, accuracy was good in both datasets but, as expected, it was slightly lower in the more real-world study using wrist-worn smartwatches in children (BREATHE) than in the more tightly controlled study using waist-worn smartphones in adults (HARuS). We implemented GGS in an offline setting, but it could be adapted for real-time prediction with streaming data.
url http://mhealth.jmir.org/2019/2/e11201/
work_keys_str_mv AT likenan applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT habrerima applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT denghuiyu applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT urmanrobert applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT morrisonjohn applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT gillilandfrankd applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT ambitejoseluis applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT stripelisdimitris applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT chiangyaoyi applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT linyijun applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT buialexat applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT kingchristine applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT hosseinianahita applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT vlieteleannevan applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT sarrafzadehmajid applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
AT eckelsandrahp applyingmultivariatesegmentationmethodstohumanactivityrecognitionfromwearablesensorsdata
_version_ 1721484135232438272