Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.

In modern biological research, OMICs techniques, such as genomics, proteomics or metabolomics, are often employed to gain deep insights into metabolic regulations and biochemical perturbations in response to a specific research question. To gain complementary biologically relevant information, multi...

Full description

Bibliographic Details
Main Author: Petters, Patrik
Format: Others
Language:English
Published: Linköpings universitet, Matematiska institutionen 2017
Subjects:
PCA
PLS
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-138112
id ndltd-UPSALLA1-oai-DiVA.org-liu-138112
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-1381122018-01-04T05:24:41ZDevelopment of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.engUtveckling av en algoritm för förbättrad tolkningsbarhet av övervakad multivariat statistisk simultan analys av flera designmatriser.Petters, PatrikLinköpings universitet, Matematiska institutionenLinköpings universitet, Tekniska fakulteten2017PCAPLSsupervised multiblock analysiscommon and distinctive variationNatural SciencesNaturvetenskapIn modern biological research, OMICs techniques, such as genomics, proteomics or metabolomics, are often employed to gain deep insights into metabolic regulations and biochemical perturbations in response to a specific research question. To gain complementary biologically relevant information, multiOMICs, i.e., several different OMICs measurements on the same specimen, is becoming increasingly frequent. To be able to take full advantage of this complementarity, joint analysis of such multiOMICs data is necessary, but this is yet an underdeveloped area. In this thesis, a theoretical background is given on general component-based methods for dimensionality reduction such as PCA, PLS for single block analysis, and multiblock PLS for co-analysis of OMICs data. This is followed by a rotation of an unsupervised analysis method. The aim of this method is to divide dimensionality-reduced data in block-distinct and common variance partitions, using the DISCO-SCA approach. Finally, an algorithm for a similar rotation of a supervised (PLS) solution is presented using data available in the literature. To the best of our knowledge, this is the first time that such an approach for rotation of a supervised analysis in block-distinct and common partitions has been developed and tested.This newly developed DISCO-PLS algorithm clearly showed an increased potential for visualisation and interpretation of data, compared to standard PLS. This is shown bybiplots of observation scores and multiblock variable loadings. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-138112application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic PCA
PLS
supervised multiblock analysis
common and distinctive variation
Natural Sciences
Naturvetenskap
spellingShingle PCA
PLS
supervised multiblock analysis
common and distinctive variation
Natural Sciences
Naturvetenskap
Petters, Patrik
Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
description In modern biological research, OMICs techniques, such as genomics, proteomics or metabolomics, are often employed to gain deep insights into metabolic regulations and biochemical perturbations in response to a specific research question. To gain complementary biologically relevant information, multiOMICs, i.e., several different OMICs measurements on the same specimen, is becoming increasingly frequent. To be able to take full advantage of this complementarity, joint analysis of such multiOMICs data is necessary, but this is yet an underdeveloped area. In this thesis, a theoretical background is given on general component-based methods for dimensionality reduction such as PCA, PLS for single block analysis, and multiblock PLS for co-analysis of OMICs data. This is followed by a rotation of an unsupervised analysis method. The aim of this method is to divide dimensionality-reduced data in block-distinct and common variance partitions, using the DISCO-SCA approach. Finally, an algorithm for a similar rotation of a supervised (PLS) solution is presented using data available in the literature. To the best of our knowledge, this is the first time that such an approach for rotation of a supervised analysis in block-distinct and common partitions has been developed and tested.This newly developed DISCO-PLS algorithm clearly showed an increased potential for visualisation and interpretation of data, compared to standard PLS. This is shown bybiplots of observation scores and multiblock variable loadings.
author Petters, Patrik
author_facet Petters, Patrik
author_sort Petters, Patrik
title Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
title_short Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
title_full Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
title_fullStr Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
title_full_unstemmed Development of a Supervised Multivariate Statistical Algorithm for Enhanced Interpretability of Multiblock Analysis.
title_sort development of a supervised multivariate statistical algorithm for enhanced interpretability of multiblock analysis.
publisher Linköpings universitet, Matematiska institutionen
publishDate 2017
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-138112
work_keys_str_mv AT petterspatrik developmentofasupervisedmultivariatestatisticalalgorithmforenhancedinterpretabilityofmultiblockanalysis
AT petterspatrik utvecklingavenalgoritmforforbattradtolkningsbarhetavovervakadmultivariatstatistisksimultananalysavfleradesignmatriser
_version_ 1718580681513107456