Flow cytometry bioinformatics.

Flow cytometry bioinformatics is the application of bioinformatics to flow cytometry data, which involves storing, retrieving, organizing, and analyzing flow cytometry data using extensive computational resources and tools. Flow cytometry bioinformatics requires extensive use of and contributes to t...

Full description

Bibliographic Details
Main Authors: Kieran O'Neill, Nima Aghaeepour, Josef Spidlen, Ryan Brinkman
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Computational Biology
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24363631/pdf/?tool=EBI
id doaj-7c6c0d300a2c497185969712d9e4b989
record_format Article
spelling doaj-7c6c0d300a2c497185969712d9e4b9892021-04-21T15:09:14ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-01912e100336510.1371/journal.pcbi.1003365Flow cytometry bioinformatics.Kieran O'NeillNima AghaeepourJosef SpidlenRyan BrinkmanFlow cytometry bioinformatics is the application of bioinformatics to flow cytometry data, which involves storing, retrieving, organizing, and analyzing flow cytometry data using extensive computational resources and tools. Flow cytometry bioinformatics requires extensive use of and contributes to the development of techniques from computational statistics and machine learning. Flow cytometry and related methods allow the quantification of multiple independent biomarkers on large numbers of single cells. The rapid growth in the multidimensionality and throughput of flow cytometry data, particularly in the 2000s, has led to the creation of a variety of computational analysis methods, data standards, and public databases for the sharing of results. Computational methods exist to assist in the preprocessing of flow cytometry data, identifying cell populations within it, matching those cell populations across samples, and performing diagnosis and discovery using the results of previous steps. For preprocessing, this includes compensating for spectral overlap, transforming data onto scales conducive to visualization and analysis, assessing data for quality, and normalizing data across samples and experiments. For population identification, tools are available to aid traditional manual identification of populations in two-dimensional scatter plots (gating), to use dimensionality reduction to aid gating, and to find populations automatically in higher dimensional space in a variety of ways. It is also possible to characterize data in more comprehensive ways, such as the density-guided binary space partitioning technique known as probability binning, or by combinatorial gating. Finally, diagnosis using flow cytometry data can be aided by supervised learning techniques, and discovery of new cell types of biological importance by high-throughput statistical methods, as part of pipelines incorporating all of the aforementioned methods. Open standards, data, and software are also key parts of flow cytometry bioinformatics. Data standards include the widely adopted Flow Cytometry Standard (FCS) defining how data from cytometers should be stored, but also several new standards under development by the International Society for Advancement of Cytometry (ISAC) to aid in storing more detailed information about experimental design and analytical steps. Open data is slowly growing with the opening of the CytoBank database in 2010 and FlowRepository in 2012, both of which allow users to freely distribute their data, and the latter of which has been recommended as the preferred repository for MIFlowCyt-compliant data by ISAC. Open software is most widely available in the form of a suite of Bioconductor packages, but is also available for web execution on the GenePattern platform.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24363631/pdf/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Kieran O'Neill
Nima Aghaeepour
Josef Spidlen
Ryan Brinkman
spellingShingle Kieran O'Neill
Nima Aghaeepour
Josef Spidlen
Ryan Brinkman
Flow cytometry bioinformatics.
PLoS Computational Biology
author_facet Kieran O'Neill
Nima Aghaeepour
Josef Spidlen
Ryan Brinkman
author_sort Kieran O'Neill
title Flow cytometry bioinformatics.
title_short Flow cytometry bioinformatics.
title_full Flow cytometry bioinformatics.
title_fullStr Flow cytometry bioinformatics.
title_full_unstemmed Flow cytometry bioinformatics.
title_sort flow cytometry bioinformatics.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2013-01-01
description Flow cytometry bioinformatics is the application of bioinformatics to flow cytometry data, which involves storing, retrieving, organizing, and analyzing flow cytometry data using extensive computational resources and tools. Flow cytometry bioinformatics requires extensive use of and contributes to the development of techniques from computational statistics and machine learning. Flow cytometry and related methods allow the quantification of multiple independent biomarkers on large numbers of single cells. The rapid growth in the multidimensionality and throughput of flow cytometry data, particularly in the 2000s, has led to the creation of a variety of computational analysis methods, data standards, and public databases for the sharing of results. Computational methods exist to assist in the preprocessing of flow cytometry data, identifying cell populations within it, matching those cell populations across samples, and performing diagnosis and discovery using the results of previous steps. For preprocessing, this includes compensating for spectral overlap, transforming data onto scales conducive to visualization and analysis, assessing data for quality, and normalizing data across samples and experiments. For population identification, tools are available to aid traditional manual identification of populations in two-dimensional scatter plots (gating), to use dimensionality reduction to aid gating, and to find populations automatically in higher dimensional space in a variety of ways. It is also possible to characterize data in more comprehensive ways, such as the density-guided binary space partitioning technique known as probability binning, or by combinatorial gating. Finally, diagnosis using flow cytometry data can be aided by supervised learning techniques, and discovery of new cell types of biological importance by high-throughput statistical methods, as part of pipelines incorporating all of the aforementioned methods. Open standards, data, and software are also key parts of flow cytometry bioinformatics. Data standards include the widely adopted Flow Cytometry Standard (FCS) defining how data from cytometers should be stored, but also several new standards under development by the International Society for Advancement of Cytometry (ISAC) to aid in storing more detailed information about experimental design and analytical steps. Open data is slowly growing with the opening of the CytoBank database in 2010 and FlowRepository in 2012, both of which allow users to freely distribute their data, and the latter of which has been recommended as the preferred repository for MIFlowCyt-compliant data by ISAC. Open software is most widely available in the form of a suite of Bioconductor packages, but is also available for web execution on the GenePattern platform.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24363631/pdf/?tool=EBI
work_keys_str_mv AT kieranoneill flowcytometrybioinformatics
AT nimaaghaeepour flowcytometrybioinformatics
AT josefspidlen flowcytometrybioinformatics
AT ryanbrinkman flowcytometrybioinformatics
_version_ 1714667921259626496