Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.

Many biological data acquisition platforms suffer from inadvertent inclusion of biologically irrelevant variance in analyzed data, collectively termed batch effects. Batch effects can lead to difficulties in downstream analysis by lowering the power to detect biologically interesting differences and...

Full description

Bibliographic Details
Main Authors:	Robin Gradin, Malin Lindstedt, Henrik Johansson
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2019-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0212669

id	doaj-1ee771e8db284f35879850e1c58eddf4
record_format	Article
spelling	doaj-1ee771e8db284f35879850e1c58eddf42021-03-03T20:51:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01142e021266910.1371/journal.pone.0212669Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.Robin GradinMalin LindstedtHenrik JohanssonMany biological data acquisition platforms suffer from inadvertent inclusion of biologically irrelevant variance in analyzed data, collectively termed batch effects. Batch effects can lead to difficulties in downstream analysis by lowering the power to detect biologically interesting differences and can in certain instances lead to false discoveries. They are especially troublesome in predictive modelling where samples in training sets and test sets are often completely correlated with batches. In this article, we present BARA, a normalization method for adjusting batch effects in predictive modelling. BARA utilizes a few reference samples to adjust for batch effects in a compressed data space spanned by the training set. We evaluate BARA using a collection of publicly available datasets and three different prediction models, and compare its performance to already existing methods developed for similar purposes. The results show that data normalized with BARA generates high and consistent prediction performances. Further, they suggest that BARA produces reliable performances independent of the examined classifiers. We therefore conclude that BARA has great potential to facilitate the development of predictive assays where test sets and training sets are correlated with batch.https://doi.org/10.1371/journal.pone.0212669
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Robin Gradin Malin Lindstedt Henrik Johansson
spellingShingle	Robin Gradin Malin Lindstedt Henrik Johansson Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects. PLoS ONE
author_facet	Robin Gradin Malin Lindstedt Henrik Johansson
author_sort	Robin Gradin
title	Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.
title_short	Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.
title_full	Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.
title_fullStr	Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.
title_full_unstemmed	Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.
title_sort	batch adjustment by reference alignment (bara): improved prediction performance in biological test sets with batch effects.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2019-01-01
description	Many biological data acquisition platforms suffer from inadvertent inclusion of biologically irrelevant variance in analyzed data, collectively termed batch effects. Batch effects can lead to difficulties in downstream analysis by lowering the power to detect biologically interesting differences and can in certain instances lead to false discoveries. They are especially troublesome in predictive modelling where samples in training sets and test sets are often completely correlated with batches. In this article, we present BARA, a normalization method for adjusting batch effects in predictive modelling. BARA utilizes a few reference samples to adjust for batch effects in a compressed data space spanned by the training set. We evaluate BARA using a collection of publicly available datasets and three different prediction models, and compare its performance to already existing methods developed for similar purposes. The results show that data normalized with BARA generates high and consistent prediction performances. Further, they suggest that BARA produces reliable performances independent of the examined classifiers. We therefore conclude that BARA has great potential to facilitate the development of predictive assays where test sets and training sets are correlated with batch.
url	https://doi.org/10.1371/journal.pone.0212669
work_keys_str_mv	AT robingradin batchadjustmentbyreferencealignmentbaraimprovedpredictionperformanceinbiologicaltestsetswithbatcheffects AT malinlindstedt batchadjustmentbyreferencealignmentbaraimprovedpredictionperformanceinbiologicaltestsetswithbatcheffects AT henrikjohansson batchadjustmentbyreferencealignmentbaraimprovedpredictionperformanceinbiologicaltestsetswithbatcheffects
_version_	1714820099749183488

Batch adjustment by reference alignment (BARA): Improved prediction performance in biological test sets with batch effects.

Similar Items