Using Markov models and a stochastic Lipschitz condition for genetic analyses

A proper understanding of biological processes requires an understanding of genetics and evolutionary mechanisms. The vast amounts of genetical information that can routinely be extracted with modern technology have so far not been accompanied by an equally extended understanding of the correspondin...

Full description

Bibliographic Details
Main Author: Nettelblad, Carl
Format: Others
Language:English
Published: Uppsala universitet, Avdelningen för teknisk databehandling 2010
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-120295
id ndltd-UPSALLA1-oai-DiVA.org-uu-120295
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-1202952018-01-13T05:11:52ZUsing Markov models and a stochastic Lipschitz condition for genetic analysesengNettelblad, CarlUppsala universitet, Avdelningen för teknisk databehandlingUppsala universitet, Tillämpad beräkningsvetenskap2010Computational MathematicsBeräkningsmatematikGeneticsGenetikSoftware EngineeringProgramvaruteknikA proper understanding of biological processes requires an understanding of genetics and evolutionary mechanisms. The vast amounts of genetical information that can routinely be extracted with modern technology have so far not been accompanied by an equally extended understanding of the corresponding processes. The relationship between a single gene and the resulting properties, phenotype of an individual is rarely clear. This thesis addresses several computational challenges regarding identifying and assessing the effects of quantitative trait loci (QTL), genomic positions where variation is affecting a trait. The genetic information available for each individual is rarely complete, meaning that the unknown variable of the genotype in the loci modelled also needs to be addressed. This thesis contains the presentation of new tools for employing the information that is available in a way that maximizes the information used, by using hidden Markov models (HMMs), resulting in a change in algorithm runtime complexity from exponential to log-linear, in terms of the number of markers. It also proposes the introduction of inferred haplotypes to further increase the power to assess these unknown variables for pedigrees of related genetically diverse individuals. Modelling consequences of partial genetic information are also treated. Furthermore, genes are not directly affecting traits, but are rather expressed in the environment of and in concordance with other genes. Therefore, significant interactions can be expected within genes, where some combination of genetic variation gives a pronounced, or even opposite, effect, compared to when occurring separately. This thesis addresses how to perform efficient scans for multiple interacting loci, as well as how to derive highly accurate empirical significance tests in these settings. This is done by analyzing the mathematical properties of the objective function describing the quality of model fits, and reformulating it through a simple transformation. Combined with the presented prototype of a problem-solving environment, these developments can make multi-dimensional searches for QTL routine, allowing the pursuit of new biological insight. eSSENCELicentiate thesis, comprehensive summaryinfo:eu-repo/semantics/masterThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-120295IT licentiate theses / Uppsala University, Department of Information Technology, 1404-5117 ; 2010-002application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Computational Mathematics
Beräkningsmatematik
Genetics
Genetik
Software Engineering
Programvaruteknik
spellingShingle Computational Mathematics
Beräkningsmatematik
Genetics
Genetik
Software Engineering
Programvaruteknik
Nettelblad, Carl
Using Markov models and a stochastic Lipschitz condition for genetic analyses
description A proper understanding of biological processes requires an understanding of genetics and evolutionary mechanisms. The vast amounts of genetical information that can routinely be extracted with modern technology have so far not been accompanied by an equally extended understanding of the corresponding processes. The relationship between a single gene and the resulting properties, phenotype of an individual is rarely clear. This thesis addresses several computational challenges regarding identifying and assessing the effects of quantitative trait loci (QTL), genomic positions where variation is affecting a trait. The genetic information available for each individual is rarely complete, meaning that the unknown variable of the genotype in the loci modelled also needs to be addressed. This thesis contains the presentation of new tools for employing the information that is available in a way that maximizes the information used, by using hidden Markov models (HMMs), resulting in a change in algorithm runtime complexity from exponential to log-linear, in terms of the number of markers. It also proposes the introduction of inferred haplotypes to further increase the power to assess these unknown variables for pedigrees of related genetically diverse individuals. Modelling consequences of partial genetic information are also treated. Furthermore, genes are not directly affecting traits, but are rather expressed in the environment of and in concordance with other genes. Therefore, significant interactions can be expected within genes, where some combination of genetic variation gives a pronounced, or even opposite, effect, compared to when occurring separately. This thesis addresses how to perform efficient scans for multiple interacting loci, as well as how to derive highly accurate empirical significance tests in these settings. This is done by analyzing the mathematical properties of the objective function describing the quality of model fits, and reformulating it through a simple transformation. Combined with the presented prototype of a problem-solving environment, these developments can make multi-dimensional searches for QTL routine, allowing the pursuit of new biological insight. === eSSENCE
author Nettelblad, Carl
author_facet Nettelblad, Carl
author_sort Nettelblad, Carl
title Using Markov models and a stochastic Lipschitz condition for genetic analyses
title_short Using Markov models and a stochastic Lipschitz condition for genetic analyses
title_full Using Markov models and a stochastic Lipschitz condition for genetic analyses
title_fullStr Using Markov models and a stochastic Lipschitz condition for genetic analyses
title_full_unstemmed Using Markov models and a stochastic Lipschitz condition for genetic analyses
title_sort using markov models and a stochastic lipschitz condition for genetic analyses
publisher Uppsala universitet, Avdelningen för teknisk databehandling
publishDate 2010
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-120295
work_keys_str_mv AT nettelbladcarl usingmarkovmodelsandastochasticlipschitzconditionforgeneticanalyses
_version_ 1718607498813898752