A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets

In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI sc...

Full description

Bibliographic Details
Main Authors: Sandeep R Panta, Runtang eWang, Jill eFries, Ravi eKalyanam, Nicole eSpeer, Margaret eKing, Marie eBanich, Kent eKiehl, Michael eMilham, Tor D Wager, Jessica A Turner, Sergey M Plis, Vince D Calhoun
Format: Article
Language:English
Published: Frontiers Media S.A. 2016-03-01
Series:Frontiers in Neuroinformatics
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fninf.2016.00009/full
id doaj-ae18ed043e744b1ebf441454040cce15
record_format Article
spelling doaj-ae18ed043e744b1ebf441454040cce152020-11-24T23:12:24ZengFrontiers Media S.A.Frontiers in Neuroinformatics1662-51962016-03-011010.3389/fninf.2016.00009181674A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data SetsSandeep R Panta0Runtang eWang1Jill eFries2Ravi eKalyanam3Nicole eSpeer4Margaret eKing5Marie eBanich6Kent eKiehl7Kent eKiehl8Michael eMilham9Tor D Wager10Jessica A Turner11Sergey M Plis12Sergey M Plis13Vince D Calhoun14Vince D Calhoun15Mind Research Network & LBERIMind Research Network & LBERIMind Research Network & LBERIMind Research Network & LBERIUniversity of Boulder ColoradoMind Research Network & LBERIUniversity of Boulder ColoradoMind Research Network & LBERIDept. of Psychology, University of New MexicoThe Child Mind Institute & The Nathan Kline InstituteUniversity of Boulder ColoradoDept. of Psychology, Georgia Tech UniversityMind Research Network & LBERIDept. of ECE, University of New MexicoMind Research Network & LBERIDept. of ECE, University of New MexicoIn this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g. brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples including 1) quality control measures calculated from phantom data over time, 2) quality control data from human functional MRI data across various studies, scanners, sites, 3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e. data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger.http://journal.frontiersin.org/Journal/10.3389/fninf.2016.00009/fullNeurosciencevisualizationbig datadata sharingmagnetic resonance imaging (MRI)t-SNE (t-Distributed Stochastic Neighbor Embedding)
collection DOAJ
language English
format Article
sources DOAJ
author Sandeep R Panta
Runtang eWang
Jill eFries
Ravi eKalyanam
Nicole eSpeer
Margaret eKing
Marie eBanich
Kent eKiehl
Kent eKiehl
Michael eMilham
Tor D Wager
Jessica A Turner
Sergey M Plis
Sergey M Plis
Vince D Calhoun
Vince D Calhoun
spellingShingle Sandeep R Panta
Runtang eWang
Jill eFries
Ravi eKalyanam
Nicole eSpeer
Margaret eKing
Marie eBanich
Kent eKiehl
Kent eKiehl
Michael eMilham
Tor D Wager
Jessica A Turner
Sergey M Plis
Sergey M Plis
Vince D Calhoun
Vince D Calhoun
A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
Frontiers in Neuroinformatics
Neuroscience
visualization
big data
data sharing
magnetic resonance imaging (MRI)
t-SNE (t-Distributed Stochastic Neighbor Embedding)
author_facet Sandeep R Panta
Runtang eWang
Jill eFries
Ravi eKalyanam
Nicole eSpeer
Margaret eKing
Marie eBanich
Kent eKiehl
Kent eKiehl
Michael eMilham
Tor D Wager
Jessica A Turner
Sergey M Plis
Sergey M Plis
Vince D Calhoun
Vince D Calhoun
author_sort Sandeep R Panta
title A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_short A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_full A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_fullStr A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_full_unstemmed A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_sort tool for interactive data visualization: application to over 10,000 brain imaging and phantom mri data sets
publisher Frontiers Media S.A.
series Frontiers in Neuroinformatics
issn 1662-5196
publishDate 2016-03-01
description In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g. brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples including 1) quality control measures calculated from phantom data over time, 2) quality control data from human functional MRI data across various studies, scanners, sites, 3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e. data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger.
topic Neuroscience
visualization
big data
data sharing
magnetic resonance imaging (MRI)
t-SNE (t-Distributed Stochastic Neighbor Embedding)
url http://journal.frontiersin.org/Journal/10.3389/fninf.2016.00009/full
work_keys_str_mv AT sandeeprpanta atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT runtangewang atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT jillefries atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT raviekalyanam atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT nicoleespeer atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT margareteking atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT marieebanich atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kentekiehl atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kentekiehl atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT michaelemilham atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT tordwager atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT jessicaaturner atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT sergeymplis atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT sergeymplis atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT vincedcalhoun atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT vincedcalhoun atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT sandeeprpanta toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT runtangewang toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT jillefries toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT raviekalyanam toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT nicoleespeer toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT margareteking toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT marieebanich toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kentekiehl toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kentekiehl toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT michaelemilham toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT tordwager toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT jessicaaturner toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT sergeymplis toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT sergeymplis toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT vincedcalhoun toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT vincedcalhoun toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
_version_ 1725600812272451584