Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome

<p>Abstract</p> <p>Background</p> <p>Flax (<it>Linum usitatissimum</it> L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of th...

Full description

Bibliographic Details
Main Authors: González Leonardo Galindo, Deyholos Michael K
Format: Article
Language:English
Published: BMC 2012-11-01
Series:BMC Genomics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2164/13/644
id doaj-79032110b830434f8212e99306ee6dce
record_format Article
spelling doaj-79032110b830434f8212e99306ee6dce2020-11-24T20:51:43ZengBMCBMC Genomics1471-21642012-11-0113164410.1186/1471-2164-13-644Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genomeGonzález Leonardo GalindoDeyholos Michael K<p>Abstract</p> <p>Background</p> <p>Flax (<it>Linum usitatissimum</it> L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (<it>Linum usitatissimum</it>) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression.</p> <p>Results</p> <p>Four <it>de novo</it> repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase <it>Viridiplantae</it> division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and <it>Mutator</it> DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, <it>Copia</it> showed more diversity, recent insertions and conserved domains than the <it>Gypsy</it>, demonstrating their importance in genome evolution.</p> <p>Conclusions</p> <p>The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the <it>Copia</it> LTR superfamily are inserted close to coding regions, we suggest <it>Copia</it> elements have a greater influence on recent flax genome evolution while <it>Gypsy</it> elements have become residual and highly mutated.</p> http://www.biomedcentral.com/1471-2164/13/644Transposable elementsFlaxGenome evolutionLTR elementsGene expression
collection DOAJ
language English
format Article
sources DOAJ
author González Leonardo Galindo
Deyholos Michael K
spellingShingle González Leonardo Galindo
Deyholos Michael K
Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
BMC Genomics
Transposable elements
Flax
Genome evolution
LTR elements
Gene expression
author_facet González Leonardo Galindo
Deyholos Michael K
author_sort González Leonardo Galindo
title Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
title_short Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
title_full Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
title_fullStr Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
title_full_unstemmed Identification, characterization and distribution of transposable elements in the flax (<it>Linum usitatissimum</it> L.) genome
title_sort identification, characterization and distribution of transposable elements in the flax (<it>linum usitatissimum</it> l.) genome
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2012-11-01
description <p>Abstract</p> <p>Background</p> <p>Flax (<it>Linum usitatissimum</it> L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (<it>Linum usitatissimum</it>) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression.</p> <p>Results</p> <p>Four <it>de novo</it> repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase <it>Viridiplantae</it> division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and <it>Mutator</it> DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, <it>Copia</it> showed more diversity, recent insertions and conserved domains than the <it>Gypsy</it>, demonstrating their importance in genome evolution.</p> <p>Conclusions</p> <p>The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the <it>Copia</it> LTR superfamily are inserted close to coding regions, we suggest <it>Copia</it> elements have a greater influence on recent flax genome evolution while <it>Gypsy</it> elements have become residual and highly mutated.</p>
topic Transposable elements
Flax
Genome evolution
LTR elements
Gene expression
url http://www.biomedcentral.com/1471-2164/13/644
work_keys_str_mv AT gonzalezleonardogalindo identificationcharacterizationanddistributionoftransposableelementsintheflaxitlinumusitatissimumitlgenome
AT deyholosmichaelk identificationcharacterizationanddistributionoftransposableelementsintheflaxitlinumusitatissimumitlgenome
_version_ 1716801532791357440