Organizational heterogeneity of vertebrate genomes.
Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, r...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3288070?pdf=render |
id |
doaj-4209711d712044339e50a98e3df86852 |
---|---|
record_format |
Article |
spelling |
doaj-4209711d712044339e50a98e3df868522020-11-25T01:11:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0172e3207610.1371/journal.pone.0032076Organizational heterogeneity of vertebrate genomes.Svetlana FrenkelValery KirzhnerAbraham KorolGenomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.http://europepmc.org/articles/PMC3288070?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Svetlana Frenkel Valery Kirzhner Abraham Korol |
spellingShingle |
Svetlana Frenkel Valery Kirzhner Abraham Korol Organizational heterogeneity of vertebrate genomes. PLoS ONE |
author_facet |
Svetlana Frenkel Valery Kirzhner Abraham Korol |
author_sort |
Svetlana Frenkel |
title |
Organizational heterogeneity of vertebrate genomes. |
title_short |
Organizational heterogeneity of vertebrate genomes. |
title_full |
Organizational heterogeneity of vertebrate genomes. |
title_fullStr |
Organizational heterogeneity of vertebrate genomes. |
title_full_unstemmed |
Organizational heterogeneity of vertebrate genomes. |
title_sort |
organizational heterogeneity of vertebrate genomes. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2012-01-01 |
description |
Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences. |
url |
http://europepmc.org/articles/PMC3288070?pdf=render |
work_keys_str_mv |
AT svetlanafrenkel organizationalheterogeneityofvertebrategenomes AT valerykirzhner organizationalheterogeneityofvertebrategenomes AT abrahamkorol organizationalheterogeneityofvertebrategenomes |
_version_ |
1725168340549238784 |