De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.

Anopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making develop...

Full description

Bibliographic Details
Main Authors: Jacob E Crawford, Wamdaogo M Guelbeogo, Antoine Sanou, Alphonse Traoré, Kenneth D Vernick, N'Fale Sagnon, Brian P Lazzaro
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-12-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2996306?pdf=render
id doaj-ee73b87135d8411d876442c09a676fed
record_format Article
spelling doaj-ee73b87135d8411d876442c09a676fed2020-11-25T01:53:32ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-12-01512e1420210.1371/journal.pone.0014202De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.Jacob E CrawfordWamdaogo M GuelbeogoAntoine SanouAlphonse TraoréKenneth D VernickN'Fale SagnonBrian P LazzaroAnopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making development and implementation of novel disease control efforts more difficult. The An. funestus genome has not been sequenced, so in order to facilitate genome-scale experimental biology, we have sequenced the adult female transcriptome of An. funestus from a newly founded colony in Burkina Faso, West Africa, using the Illumina GAIIx next generation sequencing platform.We assembled short Illumina reads de novo using a novel approach involving iterative de novo assemblies and "target-based" contig clustering. We then selected a conservative set of 15,527 contigs through comparisons to four Dipteran transcriptomes as well as multiple functional and conserved protein domain databases. Comparison to the Anopheles gambiae immune system identified 339 contigs as putative immune genes, thus identifying a large portion of the immune system that can form the basis for subsequent studies of this important malaria vector. We identified 5,434 1:1 orthologues between An. funestus and An. gambiae and found that among these 1:1 orthologues, the protein sequence of those with putative immune function were significantly more diverged than the transcriptome as a whole. Short read alignments to the contig set revealed almost 367,000 genetic polymorphisms segregating in the An. funestus colony and demonstrated the utility of the assembled transcriptome for use in RNA-seq based measurements of gene expression.We developed a pipeline that makes de novo transcriptome sequencing possible in virtually any organism at a very reasonable cost ($6,300 in sequencing costs in our case). We anticipate that our approach could be used to develop genomic resources in a diversity of systems for which full genome sequence is currently unavailable. Our An. funestus contig set and analytical results provide a valuable resource for future studies in this non-model, but epidemiologically critical, vector insect.http://europepmc.org/articles/PMC2996306?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Jacob E Crawford
Wamdaogo M Guelbeogo
Antoine Sanou
Alphonse Traoré
Kenneth D Vernick
N'Fale Sagnon
Brian P Lazzaro
spellingShingle Jacob E Crawford
Wamdaogo M Guelbeogo
Antoine Sanou
Alphonse Traoré
Kenneth D Vernick
N'Fale Sagnon
Brian P Lazzaro
De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
PLoS ONE
author_facet Jacob E Crawford
Wamdaogo M Guelbeogo
Antoine Sanou
Alphonse Traoré
Kenneth D Vernick
N'Fale Sagnon
Brian P Lazzaro
author_sort Jacob E Crawford
title De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
title_short De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
title_full De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
title_fullStr De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
title_full_unstemmed De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology.
title_sort de novo transcriptome sequencing in anopheles funestus using illumina rna-seq technology.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2010-12-01
description Anopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making development and implementation of novel disease control efforts more difficult. The An. funestus genome has not been sequenced, so in order to facilitate genome-scale experimental biology, we have sequenced the adult female transcriptome of An. funestus from a newly founded colony in Burkina Faso, West Africa, using the Illumina GAIIx next generation sequencing platform.We assembled short Illumina reads de novo using a novel approach involving iterative de novo assemblies and "target-based" contig clustering. We then selected a conservative set of 15,527 contigs through comparisons to four Dipteran transcriptomes as well as multiple functional and conserved protein domain databases. Comparison to the Anopheles gambiae immune system identified 339 contigs as putative immune genes, thus identifying a large portion of the immune system that can form the basis for subsequent studies of this important malaria vector. We identified 5,434 1:1 orthologues between An. funestus and An. gambiae and found that among these 1:1 orthologues, the protein sequence of those with putative immune function were significantly more diverged than the transcriptome as a whole. Short read alignments to the contig set revealed almost 367,000 genetic polymorphisms segregating in the An. funestus colony and demonstrated the utility of the assembled transcriptome for use in RNA-seq based measurements of gene expression.We developed a pipeline that makes de novo transcriptome sequencing possible in virtually any organism at a very reasonable cost ($6,300 in sequencing costs in our case). We anticipate that our approach could be used to develop genomic resources in a diversity of systems for which full genome sequence is currently unavailable. Our An. funestus contig set and analytical results provide a valuable resource for future studies in this non-model, but epidemiologically critical, vector insect.
url http://europepmc.org/articles/PMC2996306?pdf=render
work_keys_str_mv AT jacobecrawford denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT wamdaogomguelbeogo denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT antoinesanou denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT alphonsetraore denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT kennethdvernick denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT nfalesagnon denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
AT brianplazzaro denovotranscriptomesequencinginanophelesfunestususingilluminarnaseqtechnology
_version_ 1724990435167830016