Identification of significant features by the Global Mean Rank test.

With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such a...

Full description

Bibliographic Details
Main Authors: Martin Klammer, J Nikolaj Dybowski, Daniel Hoffmann, Christoph Schaab
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/25119995/pdf/?tool=EBI
id doaj-68a9b404cbf947448e605a83457dba17
record_format Article
spelling doaj-68a9b404cbf947448e605a83457dba172021-03-04T09:08:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0198e10450410.1371/journal.pone.0104504Identification of significant features by the Global Mean Rank test.Martin KlammerJ Nikolaj DybowskiDaniel HoffmannChristoph SchaabWith the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/25119995/pdf/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Martin Klammer
J Nikolaj Dybowski
Daniel Hoffmann
Christoph Schaab
spellingShingle Martin Klammer
J Nikolaj Dybowski
Daniel Hoffmann
Christoph Schaab
Identification of significant features by the Global Mean Rank test.
PLoS ONE
author_facet Martin Klammer
J Nikolaj Dybowski
Daniel Hoffmann
Christoph Schaab
author_sort Martin Klammer
title Identification of significant features by the Global Mean Rank test.
title_short Identification of significant features by the Global Mean Rank test.
title_full Identification of significant features by the Global Mean Rank test.
title_fullStr Identification of significant features by the Global Mean Rank test.
title_full_unstemmed Identification of significant features by the Global Mean Rank test.
title_sort identification of significant features by the global mean rank test.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/25119995/pdf/?tool=EBI
work_keys_str_mv AT martinklammer identificationofsignificantfeaturesbytheglobalmeanranktest
AT jnikolajdybowski identificationofsignificantfeaturesbytheglobalmeanranktest
AT danielhoffmann identificationofsignificantfeaturesbytheglobalmeanranktest
AT christophschaab identificationofsignificantfeaturesbytheglobalmeanranktest
_version_ 1714807437299548160