Stochastic modeling of biological sequence evolution

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005. === This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. === Includes bibliographical ref...

Full description

Bibliographic Details
Main Author: Xu, Keyuan
Other Authors: George C. Verghese and Peter C. Doerschuk.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2006
Subjects:
Online Access:http://hdl.handle.net/1721.1/32113
id ndltd-MIT-oai-dspace.mit.edu-1721.1-32113
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-321132019-05-02T16:25:51Z Stochastic modeling of biological sequence evolution Xu, Keyuan George C. Verghese and Peter C. Doerschuk. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Includes bibliographical references (leaves 81-86). Markov models of sequence evolution are a fundamental building block for making inferences in biological research. This thesis reviews several major techniques developed to estimate parameters of Markov models of sequence evolution and presents a new approach for evaluating and comparing estimation techniques. Current methods for evaluating estimation techniques require sequence data from populations with well-known phylogenetic relationships. Such data is not always available since phylogenetic relationships can never be known with certainty. We propose generating sequence data for the purpose of estimation technique evaluation by simulating sequence evolution in a controlled setting. Our elementary simulator uses a Markov model and a binary branching process, which dynamically builds a phylogenetic tree from an initial seed sequence. The sequences at the leaves of the tree can then be used as input to estimation techniques. We demonstrate our evaluation approach on Arvestad and Bruno's estimation method, and show how our approach can reveal performance variations empirically. The results of our simulation can be used as a guide towards improving estimation techniques. by Keyuan Xu. M.Eng. 2006-03-28T19:52:29Z 2006-03-28T19:52:29Z 2005 2005 Thesis http://hdl.handle.net/1721.1/32113 62558588 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 86 leaves 484054 bytes 479559 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Xu, Keyuan
Stochastic modeling of biological sequence evolution
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005. === This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. === Includes bibliographical references (leaves 81-86). === Markov models of sequence evolution are a fundamental building block for making inferences in biological research. This thesis reviews several major techniques developed to estimate parameters of Markov models of sequence evolution and presents a new approach for evaluating and comparing estimation techniques. Current methods for evaluating estimation techniques require sequence data from populations with well-known phylogenetic relationships. Such data is not always available since phylogenetic relationships can never be known with certainty. We propose generating sequence data for the purpose of estimation technique evaluation by simulating sequence evolution in a controlled setting. Our elementary simulator uses a Markov model and a binary branching process, which dynamically builds a phylogenetic tree from an initial seed sequence. The sequences at the leaves of the tree can then be used as input to estimation techniques. We demonstrate our evaluation approach on Arvestad and Bruno's estimation method, and show how our approach can reveal performance variations empirically. The results of our simulation can be used as a guide towards improving estimation techniques. === by Keyuan Xu. === M.Eng.
author2 George C. Verghese and Peter C. Doerschuk.
author_facet George C. Verghese and Peter C. Doerschuk.
Xu, Keyuan
author Xu, Keyuan
author_sort Xu, Keyuan
title Stochastic modeling of biological sequence evolution
title_short Stochastic modeling of biological sequence evolution
title_full Stochastic modeling of biological sequence evolution
title_fullStr Stochastic modeling of biological sequence evolution
title_full_unstemmed Stochastic modeling of biological sequence evolution
title_sort stochastic modeling of biological sequence evolution
publisher Massachusetts Institute of Technology
publishDate 2006
url http://hdl.handle.net/1721.1/32113
work_keys_str_mv AT xukeyuan stochasticmodelingofbiologicalsequenceevolution
_version_ 1719040307946848256