Modelling dependencies in genetic-marker data and its application to haplotype analysis

The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is...

Full description

Bibliographic Details
Main Author: Schouten, Michael T.
Published: University of Edinburgh 2008
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639
id ndltd-bl.uk-oai-ethos.bl.uk-661639
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6616392018-06-26T03:12:18ZModelling dependencies in genetic-marker data and its application to haplotype analysisSchouten, Michael T.2008The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is believed that reconstructed haplotypes for markers in high LD can be valuable for a variety of application areas in population genetics, including reconstructing population history and identifying genetic disease variants. The thesis begins with a critical assessment of the limitations of existing methods, and then presents a unified statistical framework that can accommodate pedigree data, unrelated individuals and tightly linked markers. The framework makes use of graphical models, where inference entails representing the relevant joint probability distribution as a graph and the using associated algorithms to facilitate computation. The graphical model formalism provides invaluable tools to facilitate model specification, visualization, and inference. Once the unified framework is developed, a broad range of simulation studies are conducted using previously published haplotype data. Important contributions include demonstrating the different ways in which the haplotype frequency distribution can impact the accuracy of both the phase assignments and haplotype frequency estimates; evaluating the effectiveness of using family data to improve accuracy for different frequency profiles; and, assessing the dangers of treating related individuals as unrelated in an association study.576.58University of Edinburghhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639http://hdl.handle.net/1842/30729Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 576.58
spellingShingle 576.58
Schouten, Michael T.
Modelling dependencies in genetic-marker data and its application to haplotype analysis
description The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is believed that reconstructed haplotypes for markers in high LD can be valuable for a variety of application areas in population genetics, including reconstructing population history and identifying genetic disease variants. The thesis begins with a critical assessment of the limitations of existing methods, and then presents a unified statistical framework that can accommodate pedigree data, unrelated individuals and tightly linked markers. The framework makes use of graphical models, where inference entails representing the relevant joint probability distribution as a graph and the using associated algorithms to facilitate computation. The graphical model formalism provides invaluable tools to facilitate model specification, visualization, and inference. Once the unified framework is developed, a broad range of simulation studies are conducted using previously published haplotype data. Important contributions include demonstrating the different ways in which the haplotype frequency distribution can impact the accuracy of both the phase assignments and haplotype frequency estimates; evaluating the effectiveness of using family data to improve accuracy for different frequency profiles; and, assessing the dangers of treating related individuals as unrelated in an association study.
author Schouten, Michael T.
author_facet Schouten, Michael T.
author_sort Schouten, Michael T.
title Modelling dependencies in genetic-marker data and its application to haplotype analysis
title_short Modelling dependencies in genetic-marker data and its application to haplotype analysis
title_full Modelling dependencies in genetic-marker data and its application to haplotype analysis
title_fullStr Modelling dependencies in genetic-marker data and its application to haplotype analysis
title_full_unstemmed Modelling dependencies in genetic-marker data and its application to haplotype analysis
title_sort modelling dependencies in genetic-marker data and its application to haplotype analysis
publisher University of Edinburgh
publishDate 2008
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639
work_keys_str_mv AT schoutenmichaelt modellingdependenciesingeneticmarkerdataanditsapplicationtohaplotypeanalysis
_version_ 1718707413617475584