Model for Long-range Correlations in DNA Sequences

We address the problem of the DNA sequences developing a "dynamical" method based on the assumption that the statistical properties of DNA paths are determined by the joint action of two processes, one deterministic, with long-range correlations, and the other random and delta correlated....

Full description

Bibliographic Details
Main Author: Allegrini, Paolo
Other Authors: West, Bruce J.
Format: Others
Language:English
Published: University of North Texas 1996
Subjects:
DNA
Online Access:https://digital.library.unt.edu/ark:/67531/metadc279189/
id ndltd-unt.edu-info-ark-67531-metadc279189
record_format oai_dc
spelling ndltd-unt.edu-info-ark-67531-metadc2791892017-03-17T08:40:47Z Model for Long-range Correlations in DNA Sequences Allegrini, Paolo Nucleotide sequence. DNA evolution anomalous diffusion Copy Mistake Map (CMM) We address the problem of the DNA sequences developing a "dynamical" method based on the assumption that the statistical properties of DNA paths are determined by the joint action of two processes, one deterministic, with long-range correlations, and the other random and delta correlated. The generator of the deterministic evolution is a nonlinear map, belonging to a class of maps recently tailored to mimic the processes of weak chaos responsible for the birth of anomalous diffusion. It is assumed that the deterministic process corresponds to unknown biological rules which determine the DNA path, whereas the noise mimics the influence of an infinite-dimensional environment on the biological process under study. We prove that the resulting diffusion process, if the effect of the random process is neglected, is an a-stable Levy process with 1 < a < 2. We also show that, if the diffusion process is determined by the joint action of the deterministic and the random process, the correlation effects of the "deterministic dynamics" are cancelled on the short-range scale, but show up in the long-range one. We denote our prescription to generate statistical sequences as the Copying Mistake Map (CMM). We carry out our analysis of several DNA sequences, and of their CMM realizations, with a variety of techniques, and we especially focus on a method of regression to equilibrium, which we call the Onsager Analysis. With these techniques we establish the statistical equivalence of the real DNA sequences with their CMM realizations. We show that long-range correlations are present in exons as well as in introns, but are difficult to detect, since the exon "dynamics" is shown to be determined by theentaglement of three distinct and independent CMM's. Finally we study the validity of the stationary assumption in DNA sequences and we discuss a biological model for the short-range random process based on a folding mechanism of the nucleic acid in the cell nucleus. University of North Texas West, Bruce J. Grigolini, Paolo Deering, William D. Kowalski, Jacek M. Shanley, Mark Stephen 1996-12 Thesis or Dissertation xii, 128 leaves: ill. Text call-no: 379 N81d no.4378 local-cont-no: 1002726887-allegrini untcat: b2027050 https://digital.library.unt.edu/ark:/67531/metadc279189/ ark: ark:/67531/metadc279189 English Public Copyright Copyright is held by the author, unless otherwise noted. All rights reserved. Allegrini, Paolo
collection NDLTD
language English
format Others
sources NDLTD
topic Nucleotide sequence.
DNA
evolution
anomalous diffusion
Copy Mistake Map (CMM)
spellingShingle Nucleotide sequence.
DNA
evolution
anomalous diffusion
Copy Mistake Map (CMM)
Allegrini, Paolo
Model for Long-range Correlations in DNA Sequences
description We address the problem of the DNA sequences developing a "dynamical" method based on the assumption that the statistical properties of DNA paths are determined by the joint action of two processes, one deterministic, with long-range correlations, and the other random and delta correlated. The generator of the deterministic evolution is a nonlinear map, belonging to a class of maps recently tailored to mimic the processes of weak chaos responsible for the birth of anomalous diffusion. It is assumed that the deterministic process corresponds to unknown biological rules which determine the DNA path, whereas the noise mimics the influence of an infinite-dimensional environment on the biological process under study. We prove that the resulting diffusion process, if the effect of the random process is neglected, is an a-stable Levy process with 1 < a < 2. We also show that, if the diffusion process is determined by the joint action of the deterministic and the random process, the correlation effects of the "deterministic dynamics" are cancelled on the short-range scale, but show up in the long-range one. We denote our prescription to generate statistical sequences as the Copying Mistake Map (CMM). We carry out our analysis of several DNA sequences, and of their CMM realizations, with a variety of techniques, and we especially focus on a method of regression to equilibrium, which we call the Onsager Analysis. With these techniques we establish the statistical equivalence of the real DNA sequences with their CMM realizations. We show that long-range correlations are present in exons as well as in introns, but are difficult to detect, since the exon "dynamics" is shown to be determined by theentaglement of three distinct and independent CMM's. Finally we study the validity of the stationary assumption in DNA sequences and we discuss a biological model for the short-range random process based on a folding mechanism of the nucleic acid in the cell nucleus.
author2 West, Bruce J.
author_facet West, Bruce J.
Allegrini, Paolo
author Allegrini, Paolo
author_sort Allegrini, Paolo
title Model for Long-range Correlations in DNA Sequences
title_short Model for Long-range Correlations in DNA Sequences
title_full Model for Long-range Correlations in DNA Sequences
title_fullStr Model for Long-range Correlations in DNA Sequences
title_full_unstemmed Model for Long-range Correlations in DNA Sequences
title_sort model for long-range correlations in dna sequences
publisher University of North Texas
publishDate 1996
url https://digital.library.unt.edu/ark:/67531/metadc279189/
work_keys_str_mv AT allegrinipaolo modelforlongrangecorrelationsindnasequences
_version_ 1718431947663867904