Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders

Summary: Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic varian...

Full description

Bibliographic Details
Main Authors: Susan M. Hiatt, James M.J. Lawlor, Lori H. Handley, Ryne C. Ramaker, Brianne B. Rogers, E. Christopher Partridge, Lori Beth Boston, Melissa Williams, Christopher B. Plott, Jerry Jenkins, David E. Gray, James M. Holt, Kevin M. Bowling, E. Martina Bebin, Jane Grimwood, Jeremy Schmutz, Gregory M. Cooper
Format: Article
Language:English
Published: Elsevier 2021-04-01
Series:HGG Advances
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S266624772100004X
id doaj-9e89b02fe99e4ca599239301f8118f70
record_format Article
spelling doaj-9e89b02fe99e4ca599239301f8118f702021-05-04T07:33:08ZengElsevierHGG Advances2666-24772021-04-0122100023Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disordersSusan M. Hiatt0James M.J. Lawlor1Lori H. Handley2Ryne C. Ramaker3Brianne B. Rogers4E. Christopher Partridge5Lori Beth Boston6Melissa Williams7Christopher B. Plott8Jerry Jenkins9David E. Gray10James M. Holt11Kevin M. Bowling12E. Martina Bebin13Jane Grimwood14Jeremy Schmutz15Gregory M. Cooper16HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA; Department of Genetics, University of Alabama at Birmingham, Birmingham, AL 35924, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USADepartment of Neurology, University of Alabama at Birmingham, Birmingham, AL 35924, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USAHudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA; Corresponding authorSummary: Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic variants are difficult or impossible to detect through typical short-read sequencing approaches. Here, we describe a genomic analysis using Pacific Biosciences circular consensus sequencing (CCS) reads, which are both long (>10 kb) and accurate (>99% bp accuracy). We used CCS on six proband-parent trios with NDDs that were unexplained despite extensive testing, including genome sequencing with short reads. We identified variants and created de novo assemblies in each trio, with global metrics indicating these datasets are more accurate and comprehensive than those provided by short-read data. In one proband, we identified a likely pathogenic (LP), de novo L1-mediated insertion in CDKL5 that results in duplication of exon 3, leading to a frameshift. In a second proband, we identified multiple large de novo structural variants, including insertion-translocations affecting DGKB and MLLT3, which we show disrupt MLLT3 transcript levels. We consider this extensive structural variation likely pathogenic. The breadth and quality of variant detection, coupled to finding variants of clinical and research interest in two of six probands with unexplained NDDs, support the hypothesis that long-read genome sequencing can substantially improve rare disease genetic discovery rates.http://www.sciencedirect.com/science/article/pii/S266624772100004Xlong read sequencingclinical sequencingneurodevelopmental disorderstructural variationmobile element insertion
collection DOAJ
language English
format Article
sources DOAJ
author Susan M. Hiatt
James M.J. Lawlor
Lori H. Handley
Ryne C. Ramaker
Brianne B. Rogers
E. Christopher Partridge
Lori Beth Boston
Melissa Williams
Christopher B. Plott
Jerry Jenkins
David E. Gray
James M. Holt
Kevin M. Bowling
E. Martina Bebin
Jane Grimwood
Jeremy Schmutz
Gregory M. Cooper
spellingShingle Susan M. Hiatt
James M.J. Lawlor
Lori H. Handley
Ryne C. Ramaker
Brianne B. Rogers
E. Christopher Partridge
Lori Beth Boston
Melissa Williams
Christopher B. Plott
Jerry Jenkins
David E. Gray
James M. Holt
Kevin M. Bowling
E. Martina Bebin
Jane Grimwood
Jeremy Schmutz
Gregory M. Cooper
Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
HGG Advances
long read sequencing
clinical sequencing
neurodevelopmental disorder
structural variation
mobile element insertion
author_facet Susan M. Hiatt
James M.J. Lawlor
Lori H. Handley
Ryne C. Ramaker
Brianne B. Rogers
E. Christopher Partridge
Lori Beth Boston
Melissa Williams
Christopher B. Plott
Jerry Jenkins
David E. Gray
James M. Holt
Kevin M. Bowling
E. Martina Bebin
Jane Grimwood
Jeremy Schmutz
Gregory M. Cooper
author_sort Susan M. Hiatt
title Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
title_short Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
title_full Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
title_fullStr Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
title_full_unstemmed Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
title_sort long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
publisher Elsevier
series HGG Advances
issn 2666-2477
publishDate 2021-04-01
description Summary: Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic variants are difficult or impossible to detect through typical short-read sequencing approaches. Here, we describe a genomic analysis using Pacific Biosciences circular consensus sequencing (CCS) reads, which are both long (>10 kb) and accurate (>99% bp accuracy). We used CCS on six proband-parent trios with NDDs that were unexplained despite extensive testing, including genome sequencing with short reads. We identified variants and created de novo assemblies in each trio, with global metrics indicating these datasets are more accurate and comprehensive than those provided by short-read data. In one proband, we identified a likely pathogenic (LP), de novo L1-mediated insertion in CDKL5 that results in duplication of exon 3, leading to a frameshift. In a second proband, we identified multiple large de novo structural variants, including insertion-translocations affecting DGKB and MLLT3, which we show disrupt MLLT3 transcript levels. We consider this extensive structural variation likely pathogenic. The breadth and quality of variant detection, coupled to finding variants of clinical and research interest in two of six probands with unexplained NDDs, support the hypothesis that long-read genome sequencing can substantially improve rare disease genetic discovery rates.
topic long read sequencing
clinical sequencing
neurodevelopmental disorder
structural variation
mobile element insertion
url http://www.sciencedirect.com/science/article/pii/S266624772100004X
work_keys_str_mv AT susanmhiatt longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT jamesmjlawlor longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT lorihhandley longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT rynecramaker longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT briannebrogers longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT echristopherpartridge longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT loribethboston longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT melissawilliams longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT christopherbplott longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT jerryjenkins longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT davidegray longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT jamesmholt longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT kevinmbowling longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT emartinabebin longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT janegrimwood longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT jeremyschmutz longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
AT gregorymcooper longreadgenomesequencingforthemoleculardiagnosisofneurodevelopmentaldisorders
_version_ 1721479849780969472