Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population

The central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are en...

Full description

Bibliographic Details
Main Author: Karen H. Miga
Format: Article
Language:English
Published: MDPI AG 2019-05-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/10/5/352
id doaj-91a2bad606424137befaecb8e2c8a090
record_format Article
spelling doaj-91a2bad606424137befaecb8e2c8a0902020-11-25T00:03:58ZengMDPI AGGenes2073-44252019-05-0110535210.3390/genes10050352genes10050352Centromeric Satellite DNAs: Hidden Sequence Variation in the Human PopulationKaren H. Miga0UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, CA 95064, USAThe central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are enriched in long arrays of tandem repeats, or satellite DNAs, that are known to vary extensively in copy number and repeat structure in the human population. Satellite sequence variation in the human genome is often so large that it is detected cytogenetically, yet due to the lack of a reference assembly and informatics tools to measure this variability, contemporary high-resolution disease association studies are unable to detect causal variants in these regions. Nevertheless, recently uncovered associations between satellite DNA variation and human disease support that these regions present a substantial and biologically important fraction of human sequence variation. Therefore, there is a pressing and unmet need to detect and incorporate this uncharacterized sequence variation into broad studies of human evolution and medical genomics. Here I discuss the current knowledge of satellite DNA variation in the human genome, focusing on centromeric satellites and their potential implications for disease.https://www.mdpi.com/2073-4425/10/5/352satellite DNAcentromeresequence variationstructural variationrepeatalpha satellitehuman satellitesgenome assembly
collection DOAJ
language English
format Article
sources DOAJ
author Karen H. Miga
spellingShingle Karen H. Miga
Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
Genes
satellite DNA
centromere
sequence variation
structural variation
repeat
alpha satellite
human satellites
genome assembly
author_facet Karen H. Miga
author_sort Karen H. Miga
title Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
title_short Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
title_full Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
title_fullStr Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
title_full_unstemmed Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population
title_sort centromeric satellite dnas: hidden sequence variation in the human population
publisher MDPI AG
series Genes
issn 2073-4425
publishDate 2019-05-01
description The central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are enriched in long arrays of tandem repeats, or satellite DNAs, that are known to vary extensively in copy number and repeat structure in the human population. Satellite sequence variation in the human genome is often so large that it is detected cytogenetically, yet due to the lack of a reference assembly and informatics tools to measure this variability, contemporary high-resolution disease association studies are unable to detect causal variants in these regions. Nevertheless, recently uncovered associations between satellite DNA variation and human disease support that these regions present a substantial and biologically important fraction of human sequence variation. Therefore, there is a pressing and unmet need to detect and incorporate this uncharacterized sequence variation into broad studies of human evolution and medical genomics. Here I discuss the current knowledge of satellite DNA variation in the human genome, focusing on centromeric satellites and their potential implications for disease.
topic satellite DNA
centromere
sequence variation
structural variation
repeat
alpha satellite
human satellites
genome assembly
url https://www.mdpi.com/2073-4425/10/5/352
work_keys_str_mv AT karenhmiga centromericsatellitednashiddensequencevariationinthehumanpopulation
_version_ 1725431720850751488