A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]

Plasmodium vivax is now the predominant cause of malaria in the Asia-Pacific, South America and Horn of Africa. Laboratory studies of this species are constrained by the inability to maintain the parasite in continuous ex vivo culture, but genomic approaches provide an alternative and complementary...

Full description

Bibliographic Details
Main Authors: Sarah Auburn, Ulrike Böhme, Sascha Steinbiss, Hidayat Trimarsanto, Jessica Hostetler, Mandy Sanders, Qi Gao, Francois Nosten, Chris I. Newbold, Matthew Berriman, Ric N. Price, Thomas D. Otto
Format: Article
Language:English
Published: Wellcome 2016-11-01
Series:Wellcome Open Research
Subjects:
Online Access:https://wellcomeopenresearch.org/articles/1-4/v1
id doaj-a94e1b1e62f344ca912e55dd85154047
record_format Article
spelling doaj-a94e1b1e62f344ca912e55dd851540472020-11-24T23:50:17ZengWellcomeWellcome Open Research2398-502X2016-11-01110.12688/wellcomeopenres.9876.110647A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]Sarah Auburn0Ulrike Böhme1Sascha Steinbiss2Hidayat Trimarsanto3Jessica Hostetler4Mandy Sanders5Qi Gao6Francois Nosten7Chris I. Newbold8Matthew Berriman9Ric N. Price10Thomas D. Otto11Global and Tropical Health Division, Menzies School of Health Research and Charles Darwin University, Darwin, AustraliaMalaria Programme, Wellcome Trust Sanger Institute, Hinxton, UKMalaria Programme, Wellcome Trust Sanger Institute, Hinxton, UKEijkman Institute for Molecular Biology, Jakarta, IndonesiaLaboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, USAMalaria Programme, Wellcome Trust Sanger Institute, Hinxton, UKJiangsu Institute of Parasitic Diseases, Key Laboratory of Parasitic Disease Control and Prevention (Ministry of Health), Jiangsu Provincial Key Laboratory of Parasite Molecular Biology, Jiangsu, ChinaShoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, ThailandWeatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UKMalaria Programme, Wellcome Trust Sanger Institute, Hinxton, UKCentre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UKMalaria Programme, Wellcome Trust Sanger Institute, Hinxton, UKPlasmodium vivax is now the predominant cause of malaria in the Asia-Pacific, South America and Horn of Africa. Laboratory studies of this species are constrained by the inability to maintain the parasite in continuous ex vivo culture, but genomic approaches provide an alternative and complementary avenue to investigate the parasite’s biology and epidemiology. To date, molecular studies of P. vivax have relied on the Salvador-I reference genome sequence, derived from a monkey-adapted strain from South America. However, the Salvador-I reference remains highly fragmented with over 2500 unassembled scaffolds.  Using high-depth Illumina sequence data, we assembled and annotated a new reference sequence, PvP01, sourced directly from a patient from Papua Indonesia. Draft assemblies of isolates from China (PvC01) and Thailand (PvT01) were also prepared for comparative purposes. The quality of the PvP01 assembly is improved greatly over Salvador-I, with fragmentation reduced to 226 scaffolds. Detailed manual curation has ensured highly comprehensive annotation, with functions attributed to 58% core genes in PvP01 versus 38% in Salvador-I. The assemblies of PvP01, PvC01 and PvT01 are larger than that of Salvador-I (28-30 versus 27 Mb), owing to improved assembly of the subtelomeres.  An extensive repertoire of over 1200 Plasmodium interspersed repeat (pir) genes were identified in PvP01 compared to 346 in Salvador-I, suggesting a vital role in parasite survival or development. The manually curated PvP01 reference and PvC01 and PvT01 draft assemblies are important new resources to study vivax malaria. PvP01 is maintained at GeneDB and ongoing curation will ensure continual improvements in assembly and annotation quality.https://wellcomeopenresearch.org/articles/1-4/v1GenomicsTropical & Travel-Associated Diseases
collection DOAJ
language English
format Article
sources DOAJ
author Sarah Auburn
Ulrike Böhme
Sascha Steinbiss
Hidayat Trimarsanto
Jessica Hostetler
Mandy Sanders
Qi Gao
Francois Nosten
Chris I. Newbold
Matthew Berriman
Ric N. Price
Thomas D. Otto
spellingShingle Sarah Auburn
Ulrike Böhme
Sascha Steinbiss
Hidayat Trimarsanto
Jessica Hostetler
Mandy Sanders
Qi Gao
Francois Nosten
Chris I. Newbold
Matthew Berriman
Ric N. Price
Thomas D. Otto
A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
Wellcome Open Research
Genomics
Tropical & Travel-Associated Diseases
author_facet Sarah Auburn
Ulrike Böhme
Sascha Steinbiss
Hidayat Trimarsanto
Jessica Hostetler
Mandy Sanders
Qi Gao
Francois Nosten
Chris I. Newbold
Matthew Berriman
Ric N. Price
Thomas D. Otto
author_sort Sarah Auburn
title A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
title_short A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
title_full A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
title_fullStr A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
title_full_unstemmed A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
title_sort new plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes [version 1; referees: 2 approved]
publisher Wellcome
series Wellcome Open Research
issn 2398-502X
publishDate 2016-11-01
description Plasmodium vivax is now the predominant cause of malaria in the Asia-Pacific, South America and Horn of Africa. Laboratory studies of this species are constrained by the inability to maintain the parasite in continuous ex vivo culture, but genomic approaches provide an alternative and complementary avenue to investigate the parasite’s biology and epidemiology. To date, molecular studies of P. vivax have relied on the Salvador-I reference genome sequence, derived from a monkey-adapted strain from South America. However, the Salvador-I reference remains highly fragmented with over 2500 unassembled scaffolds.  Using high-depth Illumina sequence data, we assembled and annotated a new reference sequence, PvP01, sourced directly from a patient from Papua Indonesia. Draft assemblies of isolates from China (PvC01) and Thailand (PvT01) were also prepared for comparative purposes. The quality of the PvP01 assembly is improved greatly over Salvador-I, with fragmentation reduced to 226 scaffolds. Detailed manual curation has ensured highly comprehensive annotation, with functions attributed to 58% core genes in PvP01 versus 38% in Salvador-I. The assemblies of PvP01, PvC01 and PvT01 are larger than that of Salvador-I (28-30 versus 27 Mb), owing to improved assembly of the subtelomeres.  An extensive repertoire of over 1200 Plasmodium interspersed repeat (pir) genes were identified in PvP01 compared to 346 in Salvador-I, suggesting a vital role in parasite survival or development. The manually curated PvP01 reference and PvC01 and PvT01 draft assemblies are important new resources to study vivax malaria. PvP01 is maintained at GeneDB and ongoing curation will ensure continual improvements in assembly and annotation quality.
topic Genomics
Tropical & Travel-Associated Diseases
url https://wellcomeopenresearch.org/articles/1-4/v1
work_keys_str_mv AT sarahauburn anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT ulrikebohme anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT saschasteinbiss anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT hidayattrimarsanto anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT jessicahostetler anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT mandysanders anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT qigao anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT francoisnosten anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT chrisinewbold anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT matthewberriman anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT ricnprice anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT thomasdotto anewplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT sarahauburn newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT ulrikebohme newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT saschasteinbiss newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT hidayattrimarsanto newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT jessicahostetler newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT mandysanders newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT qigao newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT francoisnosten newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT chrisinewbold newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT matthewberriman newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT ricnprice newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
AT thomasdotto newplasmodiumvivaxreferencesequencewithimprovedassemblyofthesubtelomeresrevealsanabundanceofpirgenesversion1referees2approved
_version_ 1725479304035303424