Organizational heterogeneity of vertebrate genomes.

Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, r...

Full description

Bibliographic Details
Main Authors: Svetlana Frenkel, Valery Kirzhner, Abraham Korol
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3288070?pdf=render
id doaj-4209711d712044339e50a98e3df86852
record_format Article
spelling doaj-4209711d712044339e50a98e3df868522020-11-25T01:11:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0172e3207610.1371/journal.pone.0032076Organizational heterogeneity of vertebrate genomes.Svetlana FrenkelValery KirzhnerAbraham KorolGenomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.http://europepmc.org/articles/PMC3288070?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Svetlana Frenkel
Valery Kirzhner
Abraham Korol
spellingShingle Svetlana Frenkel
Valery Kirzhner
Abraham Korol
Organizational heterogeneity of vertebrate genomes.
PLoS ONE
author_facet Svetlana Frenkel
Valery Kirzhner
Abraham Korol
author_sort Svetlana Frenkel
title Organizational heterogeneity of vertebrate genomes.
title_short Organizational heterogeneity of vertebrate genomes.
title_full Organizational heterogeneity of vertebrate genomes.
title_fullStr Organizational heterogeneity of vertebrate genomes.
title_full_unstemmed Organizational heterogeneity of vertebrate genomes.
title_sort organizational heterogeneity of vertebrate genomes.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
url http://europepmc.org/articles/PMC3288070?pdf=render
work_keys_str_mv AT svetlanafrenkel organizationalheterogeneityofvertebrategenomes
AT valerykirzhner organizationalheterogeneityofvertebrategenomes
AT abrahamkorol organizationalheterogeneityofvertebrategenomes
_version_ 1725168340549238784