|
|
|
|
LEADER |
02326nam a2200325Ia 4500 |
001 |
10.1111-biom.13384 |
008 |
220427s2021 CNT 000 0 und d |
020 |
|
|
|a 0006341X (ISSN)
|
245 |
1 |
0 |
|a Poisson PCA: Poisson measurement error corrected PCA, with application to microbiome data
|
260 |
|
0 |
|b John Wiley and Sons Inc
|c 2021
|
856 |
|
|
|z View Fulltext in Publisher
|u https://doi.org/10.1111/biom.13384
|
520 |
3 |
|
|a In this paper, we study the problem of computing a principal component analysis of data affected by Poisson noise. We assume samples are drawn from independent Poisson distributions. We want to estimate principal components of a fixed transformation of the latent Poisson means. Our motivating example is microbiome data, though the methods apply to many other situations. We develop a semiparametric approach to correct the bias of variance estimators, both for untransformed and transformed (with particular attention to log-transformation) Poisson means. Furthermore, we incorporate methods for correcting different exposure or sequencing depth in the data. In addition to identifying the principal components, we also address the nontrivial problem of computing the principal scores in this semiparametric framework. Most previous approaches tend to take a more parametric line: for example, fitting a log-normal Poisson (PLN) model. We compare our method with the PLN approach and find that in many cases our method is better at identifying the main principal components of the latent log-transformed Poisson means, and as a further major advantage, takes far less time to compute. Comparing methods on real and simulated data, we see that our method also appears to be more robust to outliers than the parametric method. © 2020 The International Biometric Society
|
650 |
0 |
4 |
|a article
|
650 |
0 |
4 |
|a attention
|
650 |
0 |
4 |
|a error analysis
|
650 |
0 |
4 |
|a measurement error
|
650 |
0 |
4 |
|a measurement method
|
650 |
0 |
4 |
|a microbiome
|
650 |
0 |
4 |
|a microorganism
|
650 |
0 |
4 |
|a noise
|
650 |
0 |
4 |
|a nonhuman
|
650 |
0 |
4 |
|a Poisson distribution
|
650 |
0 |
4 |
|a Poisson ratio
|
650 |
0 |
4 |
|a principal component analysis
|
650 |
0 |
4 |
|a principal component analysis
|
650 |
0 |
4 |
|a simulation
|
700 |
1 |
|
|a Gu, H.
|e author
|
700 |
1 |
|
|a Huang, T.
|e author
|
700 |
1 |
|
|a Kenney, T.
|e author
|
773 |
|
|
|t Biometrics
|