Data challenges of biomedical researchers in the age of omics

Background High-throughput technologies are rapidly generating large amounts of diverse omics data. Although this offers a great opportunity, it also poses great challenges as data analysis becomes more complex. The purpose of this study was to identify the main challenges researchers face in analyz...

Full description

Bibliographic Details
Main Authors: Rolando Garcia-Milian, Denise Hersey, Milica Vukmirovic, Fanny Duprilot
Format: Article
Language:English
Published: PeerJ Inc. 2018-09-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/5553.pdf
id doaj-2e8c2977c3964d36ade2b825ff81ca04
record_format Article
spelling doaj-2e8c2977c3964d36ade2b825ff81ca042020-11-25T00:13:15ZengPeerJ Inc.PeerJ2167-83592018-09-016e555310.7717/peerj.5553Data challenges of biomedical researchers in the age of omicsRolando Garcia-Milian0Denise Hersey1Milica Vukmirovic2Fanny Duprilot3Bioinformatics Support Program, Research and Education Services, Cushing/Whitney Medical Library, Yale University, New Haven, CT, United States of AmericaScience Libraries, Lewis Science Library, Princeton University, Princeton, NJ, United States of AmericaPulmonary Critical Care & Sleep Medicine, Yale School of Medicine, Yale University, New Haven, CT, United States of AmericaService commun de la documentation, Université Denis Diderot (Paris VII), Paris, FranceBackground High-throughput technologies are rapidly generating large amounts of diverse omics data. Although this offers a great opportunity, it also poses great challenges as data analysis becomes more complex. The purpose of this study was to identify the main challenges researchers face in analyzing data, and how academic libraries can support them in this endeavor. Methods A multimodal needs assessment analysis combined an online survey sent to 860 Yale-affiliated researchers (176 responded) and 15 in-depth one-on-one semi-structured interviews. Interviews were recorded, transcribed, and analyzed using NVivo 10 software according to the thematic analysis approach. Results The survey response rate was 20%. Most respondents (78%) identified lack of adequate data analysis training (e.g., R, Python) as a main challenge, in addition to not having the proper database or software (54%) to expedite analysis. Two main themes emerged from the interviews: personnel and training needs. Researchers feel they could improve data analyses practices by having better access to the appropriate bioinformatics expertise, and/or training in data analyses tools. They also reported lack of time to acquire expertise in using bioinformatics tools and poor understanding of the resources available to facilitate analysis. Conclusions The main challenges identified by our study are: lack of adequate training for data analysis (including need to learn scripting language), need for more personnel at the University to provide data analysis and training, and inadequate communication between bioinformaticians and researchers. The authors identified the positive impact of medical and/or science libraries by establishing bioinformatics support to researchers.https://peerj.com/articles/5553.pdfInformation seeking behaviorData interpretationStatisticalGenomicsComputational biologySoftware
collection DOAJ
language English
format Article
sources DOAJ
author Rolando Garcia-Milian
Denise Hersey
Milica Vukmirovic
Fanny Duprilot
spellingShingle Rolando Garcia-Milian
Denise Hersey
Milica Vukmirovic
Fanny Duprilot
Data challenges of biomedical researchers in the age of omics
PeerJ
Information seeking behavior
Data interpretation
Statistical
Genomics
Computational biology
Software
author_facet Rolando Garcia-Milian
Denise Hersey
Milica Vukmirovic
Fanny Duprilot
author_sort Rolando Garcia-Milian
title Data challenges of biomedical researchers in the age of omics
title_short Data challenges of biomedical researchers in the age of omics
title_full Data challenges of biomedical researchers in the age of omics
title_fullStr Data challenges of biomedical researchers in the age of omics
title_full_unstemmed Data challenges of biomedical researchers in the age of omics
title_sort data challenges of biomedical researchers in the age of omics
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2018-09-01
description Background High-throughput technologies are rapidly generating large amounts of diverse omics data. Although this offers a great opportunity, it also poses great challenges as data analysis becomes more complex. The purpose of this study was to identify the main challenges researchers face in analyzing data, and how academic libraries can support them in this endeavor. Methods A multimodal needs assessment analysis combined an online survey sent to 860 Yale-affiliated researchers (176 responded) and 15 in-depth one-on-one semi-structured interviews. Interviews were recorded, transcribed, and analyzed using NVivo 10 software according to the thematic analysis approach. Results The survey response rate was 20%. Most respondents (78%) identified lack of adequate data analysis training (e.g., R, Python) as a main challenge, in addition to not having the proper database or software (54%) to expedite analysis. Two main themes emerged from the interviews: personnel and training needs. Researchers feel they could improve data analyses practices by having better access to the appropriate bioinformatics expertise, and/or training in data analyses tools. They also reported lack of time to acquire expertise in using bioinformatics tools and poor understanding of the resources available to facilitate analysis. Conclusions The main challenges identified by our study are: lack of adequate training for data analysis (including need to learn scripting language), need for more personnel at the University to provide data analysis and training, and inadequate communication between bioinformaticians and researchers. The authors identified the positive impact of medical and/or science libraries by establishing bioinformatics support to researchers.
topic Information seeking behavior
Data interpretation
Statistical
Genomics
Computational biology
Software
url https://peerj.com/articles/5553.pdf
work_keys_str_mv AT rolandogarciamilian datachallengesofbiomedicalresearchersintheageofomics
AT denisehersey datachallengesofbiomedicalresearchersintheageofomics
AT milicavukmirovic datachallengesofbiomedicalresearchersintheageofomics
AT fannyduprilot datachallengesofbiomedicalresearchersintheageofomics
_version_ 1725395434347692032