Computational characterization of 3' splice variants in the GFAP isoform family.

Glial fibrillary acidic protein (GFAP) is an intermediate filament (IF) protein specific to central nervous system (CNS) astrocytes. It has been the subject of intense interest due to its association with neurodegenerative diseases, and because of growing evidence that IF proteins not only modulate...

Full description

Bibliographic Details
Main Authors: Sarah E Boyd, Betina Nair, Sze Woei Ng, Jonathan M Keith, Jacqueline M Orian
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22479412/?tool=EBI
id doaj-cfc5401563e948259b508298673091b6
record_format Article
spelling doaj-cfc5401563e948259b508298673091b62021-03-04T00:55:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0173e3356510.1371/journal.pone.0033565Computational characterization of 3' splice variants in the GFAP isoform family.Sarah E BoydBetina NairSze Woei NgJonathan M KeithJacqueline M OrianGlial fibrillary acidic protein (GFAP) is an intermediate filament (IF) protein specific to central nervous system (CNS) astrocytes. It has been the subject of intense interest due to its association with neurodegenerative diseases, and because of growing evidence that IF proteins not only modulate cellular structure, but also cellular function. Moreover, GFAP has a family of splicing isoforms apparently more complex than that of other CNS IF proteins, consistent with it possessing a range of functional and structural roles. The gene consists of 9 exons, and to date all isoforms associated with 3' end splicing have been identified from modifications within intron 7, resulting in the generation of exon 7a (GFAPδ/ε) and 7b (GFAPκ). To better understand the nature and functional significance of variation in this region, we used a Bayesian multiple change-point approach to identify conserved regions. This is the first successful application of this method to a single gene--it has previously only been used in whole-genome analyses. We identified several highly or moderately conserved regions throughout the intron 7/7a/7b regions, including untranslated regions and regulatory features, consistent with the biology of GFAP. Several putative unconfirmed features were also identified, including a possible new isoform. We then integrated multiple computational analyses on both the DNA and protein sequences from the mouse, rat and human, showing that the major isoform, GFAPα, has highly conserved structure and features across the three species, whereas the minor isoforms GFAPδ/ε and GFAPκ have low conservation of structure and features at the distal 3' end, both relative to each other and relative to GFAPα. The overall picture suggests distinct and tightly regulated functions for the 3' end isoforms, consistent with complex astrocyte biology. The results illustrate a computational approach for characterising splicing isoform families, using both DNA and protein sequences.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22479412/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Sarah E Boyd
Betina Nair
Sze Woei Ng
Jonathan M Keith
Jacqueline M Orian
spellingShingle Sarah E Boyd
Betina Nair
Sze Woei Ng
Jonathan M Keith
Jacqueline M Orian
Computational characterization of 3' splice variants in the GFAP isoform family.
PLoS ONE
author_facet Sarah E Boyd
Betina Nair
Sze Woei Ng
Jonathan M Keith
Jacqueline M Orian
author_sort Sarah E Boyd
title Computational characterization of 3' splice variants in the GFAP isoform family.
title_short Computational characterization of 3' splice variants in the GFAP isoform family.
title_full Computational characterization of 3' splice variants in the GFAP isoform family.
title_fullStr Computational characterization of 3' splice variants in the GFAP isoform family.
title_full_unstemmed Computational characterization of 3' splice variants in the GFAP isoform family.
title_sort computational characterization of 3' splice variants in the gfap isoform family.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description Glial fibrillary acidic protein (GFAP) is an intermediate filament (IF) protein specific to central nervous system (CNS) astrocytes. It has been the subject of intense interest due to its association with neurodegenerative diseases, and because of growing evidence that IF proteins not only modulate cellular structure, but also cellular function. Moreover, GFAP has a family of splicing isoforms apparently more complex than that of other CNS IF proteins, consistent with it possessing a range of functional and structural roles. The gene consists of 9 exons, and to date all isoforms associated with 3' end splicing have been identified from modifications within intron 7, resulting in the generation of exon 7a (GFAPδ/ε) and 7b (GFAPκ). To better understand the nature and functional significance of variation in this region, we used a Bayesian multiple change-point approach to identify conserved regions. This is the first successful application of this method to a single gene--it has previously only been used in whole-genome analyses. We identified several highly or moderately conserved regions throughout the intron 7/7a/7b regions, including untranslated regions and regulatory features, consistent with the biology of GFAP. Several putative unconfirmed features were also identified, including a possible new isoform. We then integrated multiple computational analyses on both the DNA and protein sequences from the mouse, rat and human, showing that the major isoform, GFAPα, has highly conserved structure and features across the three species, whereas the minor isoforms GFAPδ/ε and GFAPκ have low conservation of structure and features at the distal 3' end, both relative to each other and relative to GFAPα. The overall picture suggests distinct and tightly regulated functions for the 3' end isoforms, consistent with complex astrocyte biology. The results illustrate a computational approach for characterising splicing isoform families, using both DNA and protein sequences.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22479412/?tool=EBI
work_keys_str_mv AT saraheboyd computationalcharacterizationof3splicevariantsinthegfapisoformfamily
AT betinanair computationalcharacterizationof3splicevariantsinthegfapisoformfamily
AT szewoeing computationalcharacterizationof3splicevariantsinthegfapisoformfamily
AT jonathanmkeith computationalcharacterizationof3splicevariantsinthegfapisoformfamily
AT jacquelinemorian computationalcharacterizationof3splicevariantsinthegfapisoformfamily
_version_ 1714809892423860224