CAG-encoded polyglutamine length polymorphism in the human genome

<p>Abstract</p> <p>Background</p> <p>Expansion of polyglutamine-encoding CAG trinucleotide repeats has been identified as the pathogenic mutation in nine different genes associated with neurodegenerative disorders. The majority of individuals clinically diagnosed with s...

Full description

Bibliographic Details
Main Authors: Hayden Michael R, Yuen Macaire MS, Yang George S, Wilkinson Anna, Lee Soo, Neal Scott J, Meynert Alison M, Mead Carri-Lyn, Huang Yong, Devon Rebecca S, Butland Stefanie L, Holt Robert A, Leavitt Blair R, Ouellette BF Francis
Format: Article
Language:English
Published: BMC 2007-05-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/8/126
id doaj-aa690c9861134106bc0b1f1d175d79d7
record_format Article
spelling doaj-aa690c9861134106bc0b1f1d175d79d72020-11-24T22:58:49ZengBMCBMC Genomics1471-21642007-05-018112610.1186/1471-2164-8-126CAG-encoded polyglutamine length polymorphism in the human genomeHayden Michael RYuen Macaire MSYang George SWilkinson AnnaLee SooNeal Scott JMeynert Alison MMead Carri-LynHuang YongDevon Rebecca SButland Stefanie LHolt Robert ALeavitt Blair ROuellette BF Francis<p>Abstract</p> <p>Background</p> <p>Expansion of polyglutamine-encoding CAG trinucleotide repeats has been identified as the pathogenic mutation in nine different genes associated with neurodegenerative disorders. The majority of individuals clinically diagnosed with spinocerebellar ataxia do not have mutations within known disease genes, and it is likely that additional ataxias or Huntington disease-like disorders will be found to be caused by this common mutational mechanism. We set out to determine the length distributions of CAG-polyglutamine tracts for the entire human genome in a set of healthy individuals in order to characterize the nature of polyglutamine repeat length variation across the human genome, to establish the background against which pathogenic repeat expansions can be detected, and to prioritize candidate genes for repeat expansion disorders.</p> <p>Results</p> <p>We found that repeats, including those in known disease genes, have unique distributions of glutamine tract lengths, as measured by fragment analysis of PCR-amplified repeat regions. This emphasizes the need to characterize each distribution and avoid making generalizations between loci. The best predictors of known disease genes were occurrence of a long CAG-tract uninterrupted by CAA codons in their reference genome sequence, and high glutamine tract length variance in the normal population. We used these parameters to identify eight priority candidate genes for polyglutamine expansion disorders. Twelve CAG-polyglutamine repeats were invariant and these can likely be excluded as candidates. We outline some confusion in the literature about this type of data, difficulties in comparing such data between publications, and its application to studies of disease prevalence in different populations. Analysis of Gene Ontology-based functions of CAG-polyglutamine-containing genes provided a visual framework for interpretation of these genes' functions. All nine known disease genes were involved in DNA-dependent regulation of transcription or in neurogenesis, as were all of the well-characterized priority candidate genes.</p> <p>Conclusion</p> <p>This publication makes freely available the normal distributions of CAG-polyglutamine repeats in the human genome. Using these background distributions, against which pathogenic expansions can be identified, we have begun screening for mutations in individuals clinically diagnosed with novel forms of spinocerebellar ataxia or Huntington disease-like disorders who do not have identified mutations within the known disease-associated genes.</p> http://www.biomedcentral.com/1471-2164/8/126
collection DOAJ
language English
format Article
sources DOAJ
author Hayden Michael R
Yuen Macaire MS
Yang George S
Wilkinson Anna
Lee Soo
Neal Scott J
Meynert Alison M
Mead Carri-Lyn
Huang Yong
Devon Rebecca S
Butland Stefanie L
Holt Robert A
Leavitt Blair R
Ouellette BF Francis
spellingShingle Hayden Michael R
Yuen Macaire MS
Yang George S
Wilkinson Anna
Lee Soo
Neal Scott J
Meynert Alison M
Mead Carri-Lyn
Huang Yong
Devon Rebecca S
Butland Stefanie L
Holt Robert A
Leavitt Blair R
Ouellette BF Francis
CAG-encoded polyglutamine length polymorphism in the human genome
BMC Genomics
author_facet Hayden Michael R
Yuen Macaire MS
Yang George S
Wilkinson Anna
Lee Soo
Neal Scott J
Meynert Alison M
Mead Carri-Lyn
Huang Yong
Devon Rebecca S
Butland Stefanie L
Holt Robert A
Leavitt Blair R
Ouellette BF Francis
author_sort Hayden Michael R
title CAG-encoded polyglutamine length polymorphism in the human genome
title_short CAG-encoded polyglutamine length polymorphism in the human genome
title_full CAG-encoded polyglutamine length polymorphism in the human genome
title_fullStr CAG-encoded polyglutamine length polymorphism in the human genome
title_full_unstemmed CAG-encoded polyglutamine length polymorphism in the human genome
title_sort cag-encoded polyglutamine length polymorphism in the human genome
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2007-05-01
description <p>Abstract</p> <p>Background</p> <p>Expansion of polyglutamine-encoding CAG trinucleotide repeats has been identified as the pathogenic mutation in nine different genes associated with neurodegenerative disorders. The majority of individuals clinically diagnosed with spinocerebellar ataxia do not have mutations within known disease genes, and it is likely that additional ataxias or Huntington disease-like disorders will be found to be caused by this common mutational mechanism. We set out to determine the length distributions of CAG-polyglutamine tracts for the entire human genome in a set of healthy individuals in order to characterize the nature of polyglutamine repeat length variation across the human genome, to establish the background against which pathogenic repeat expansions can be detected, and to prioritize candidate genes for repeat expansion disorders.</p> <p>Results</p> <p>We found that repeats, including those in known disease genes, have unique distributions of glutamine tract lengths, as measured by fragment analysis of PCR-amplified repeat regions. This emphasizes the need to characterize each distribution and avoid making generalizations between loci. The best predictors of known disease genes were occurrence of a long CAG-tract uninterrupted by CAA codons in their reference genome sequence, and high glutamine tract length variance in the normal population. We used these parameters to identify eight priority candidate genes for polyglutamine expansion disorders. Twelve CAG-polyglutamine repeats were invariant and these can likely be excluded as candidates. We outline some confusion in the literature about this type of data, difficulties in comparing such data between publications, and its application to studies of disease prevalence in different populations. Analysis of Gene Ontology-based functions of CAG-polyglutamine-containing genes provided a visual framework for interpretation of these genes' functions. All nine known disease genes were involved in DNA-dependent regulation of transcription or in neurogenesis, as were all of the well-characterized priority candidate genes.</p> <p>Conclusion</p> <p>This publication makes freely available the normal distributions of CAG-polyglutamine repeats in the human genome. Using these background distributions, against which pathogenic expansions can be identified, we have begun screening for mutations in individuals clinically diagnosed with novel forms of spinocerebellar ataxia or Huntington disease-like disorders who do not have identified mutations within the known disease-associated genes.</p>
url http://www.biomedcentral.com/1471-2164/8/126
work_keys_str_mv AT haydenmichaelr cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT yuenmacairems cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT yanggeorges cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT wilkinsonanna cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT leesoo cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT nealscottj cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT meynertalisonm cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT meadcarrilyn cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT huangyong cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT devonrebeccas cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT butlandstefaniel cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT holtroberta cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT leavittblairr cagencodedpolyglutaminelengthpolymorphisminthehumangenome
AT ouellettebffrancis cagencodedpolyglutaminelengthpolymorphisminthehumangenome
_version_ 1725646382003388416