Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey

BackgroundNovel consumer and lifestyle data, such as those collected by supermarket loyalty cards or mobile phone exercise tracking apps, offer numerous benefits for researchers seeking to understand diet- and exercise-related risk factors for diseases. However, limited resea...

Full description

Bibliographic Details
Main Authors: Clarke, Holly, Clark, Stephen, Birkin, Mark, Iles-Smith, Heather, Glaser, Adam, Morris, Michelle A
Format: Article
Language:English
Published: JMIR Publications 2021-05-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2021/5/e24236
id doaj-bba6b3addf9c42d09983ff8d273b6199
record_format Article
spelling doaj-bba6b3addf9c42d09983ff8d273b61992021-05-17T14:32:36ZengJMIR PublicationsJournal of Medical Internet Research1438-88712021-05-01235e2423610.2196/24236Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo SurveyClarke, HollyClark, StephenBirkin, MarkIles-Smith, HeatherGlaser, AdamMorris, Michelle A BackgroundNovel consumer and lifestyle data, such as those collected by supermarket loyalty cards or mobile phone exercise tracking apps, offer numerous benefits for researchers seeking to understand diet- and exercise-related risk factors for diseases. However, limited research has addressed public attitudes toward linking these data with individual health records for research purposes. Data linkage, combining data from multiple sources, provides the opportunity to enhance preexisting data sets to gain new insights. ObjectiveThe aim of this study is to identify key barriers to data linkage and recommend safeguards and procedures that would encourage individuals to share such data for potential future research. MethodsThe LifeInfo Survey consulted the public on their attitudes toward sharing consumer and lifestyle data for research purposes. Where barriers to data sharing existed, participants provided unstructured survey responses detailing what would make them more likely to share data for linkage with their health records in the future. The topic modeling technique latent Dirichlet allocation was used to analyze these textual responses to uncover common thematic topics within the texts. ResultsParticipants provided responses related to sharing their store loyalty card data (n=2338) and health and fitness app data (n=1531). Key barriers to data sharing identified through topic modeling included data safety and security, personal privacy, requirements of further information, fear of data being accessed by others, problems with data accuracy, not understanding the reason for data linkage, and not using services that produce these data. We provide recommendations for addressing these issues to establish the best practice for future researchers interested in using these data. ConclusionsThis study formulates a large-scale consultation of public attitudes toward this kind of data linkage, which is an important first step in understanding and addressing barriers to participation in research using novel consumer and lifestyle data.https://www.jmir.org/2021/5/e24236
collection DOAJ
language English
format Article
sources DOAJ
author Clarke, Holly
Clark, Stephen
Birkin, Mark
Iles-Smith, Heather
Glaser, Adam
Morris, Michelle A
spellingShingle Clarke, Holly
Clark, Stephen
Birkin, Mark
Iles-Smith, Heather
Glaser, Adam
Morris, Michelle A
Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
Journal of Medical Internet Research
author_facet Clarke, Holly
Clark, Stephen
Birkin, Mark
Iles-Smith, Heather
Glaser, Adam
Morris, Michelle A
author_sort Clarke, Holly
title Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
title_short Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
title_full Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
title_fullStr Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
title_full_unstemmed Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey
title_sort understanding barriers to novel data linkages: topic modeling of the results of the lifeinfo survey
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2021-05-01
description BackgroundNovel consumer and lifestyle data, such as those collected by supermarket loyalty cards or mobile phone exercise tracking apps, offer numerous benefits for researchers seeking to understand diet- and exercise-related risk factors for diseases. However, limited research has addressed public attitudes toward linking these data with individual health records for research purposes. Data linkage, combining data from multiple sources, provides the opportunity to enhance preexisting data sets to gain new insights. ObjectiveThe aim of this study is to identify key barriers to data linkage and recommend safeguards and procedures that would encourage individuals to share such data for potential future research. MethodsThe LifeInfo Survey consulted the public on their attitudes toward sharing consumer and lifestyle data for research purposes. Where barriers to data sharing existed, participants provided unstructured survey responses detailing what would make them more likely to share data for linkage with their health records in the future. The topic modeling technique latent Dirichlet allocation was used to analyze these textual responses to uncover common thematic topics within the texts. ResultsParticipants provided responses related to sharing their store loyalty card data (n=2338) and health and fitness app data (n=1531). Key barriers to data sharing identified through topic modeling included data safety and security, personal privacy, requirements of further information, fear of data being accessed by others, problems with data accuracy, not understanding the reason for data linkage, and not using services that produce these data. We provide recommendations for addressing these issues to establish the best practice for future researchers interested in using these data. ConclusionsThis study formulates a large-scale consultation of public attitudes toward this kind of data linkage, which is an important first step in understanding and addressing barriers to participation in research using novel consumer and lifestyle data.
url https://www.jmir.org/2021/5/e24236
work_keys_str_mv AT clarkeholly understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
AT clarkstephen understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
AT birkinmark understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
AT ilessmithheather understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
AT glaseradam understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
AT morrismichellea understandingbarrierstonoveldatalinkagestopicmodelingoftheresultsofthelifeinfosurvey
_version_ 1721438327257694208