Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data

Abstract Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously...

Full description

Bibliographic Details
Main Authors: James H. R. Farmery, Mike L. Smith, NIHR BioResource - Rare Diseases, Andy G. Lynch
Format: Article
Language:English
Published: Nature Publishing Group 2018-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-017-14403-y
id doaj-925eca9b0f364764afe1780662261112
record_format Article
spelling doaj-925eca9b0f364764afe17806622611122020-12-08T04:48:53ZengNature Publishing GroupScientific Reports2045-23222018-01-018111710.1038/s41598-017-14403-yTelomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing dataJames H. R. Farmery0Mike L. Smith1NIHR BioResource - Rare DiseasesAndy G. Lynch2Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson WayEuropean Molecular Biology Laboratory (EMBL), Genome Biology UnitCancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson WayAbstract Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously unimaginable scale. To this end, a number of approaches for estimating telomere length from whole-genome sequencing data have been proposed. Here we present Telomerecat, a novel approach to the estimation of telomere length. Previous methods have been dependent on the number of telomeres present in a cell being known, which may be problematic when analysing aneuploid cancer data and non-human samples. Telomerecat is designed to be agnostic to the number of telomeres present, making it suited for the purpose of estimating telomere length in cancer studies. Telomerecat also accounts for interstitial telomeric reads and presents a novel approach to dealing with sequencing errors. We show that Telomerecat performs well at telomere length estimation when compared to leading experimental and computational methods. Furthermore, we show that it detects expected patterns in longitudinal data, repeated measurements, and cross-species comparisons. We also apply the method to a cancer cell data, uncovering an interesting relationship with the underlying telomerase genotype.https://doi.org/10.1038/s41598-017-14403-y
collection DOAJ
language English
format Article
sources DOAJ
author James H. R. Farmery
Mike L. Smith
NIHR BioResource - Rare Diseases
Andy G. Lynch
spellingShingle James H. R. Farmery
Mike L. Smith
NIHR BioResource - Rare Diseases
Andy G. Lynch
Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
Scientific Reports
author_facet James H. R. Farmery
Mike L. Smith
NIHR BioResource - Rare Diseases
Andy G. Lynch
author_sort James H. R. Farmery
title Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
title_short Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
title_full Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
title_fullStr Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
title_full_unstemmed Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data
title_sort telomerecat: a ploidy-agnostic method for estimating telomere length from whole genome sequencing data
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2018-01-01
description Abstract Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously unimaginable scale. To this end, a number of approaches for estimating telomere length from whole-genome sequencing data have been proposed. Here we present Telomerecat, a novel approach to the estimation of telomere length. Previous methods have been dependent on the number of telomeres present in a cell being known, which may be problematic when analysing aneuploid cancer data and non-human samples. Telomerecat is designed to be agnostic to the number of telomeres present, making it suited for the purpose of estimating telomere length in cancer studies. Telomerecat also accounts for interstitial telomeric reads and presents a novel approach to dealing with sequencing errors. We show that Telomerecat performs well at telomere length estimation when compared to leading experimental and computational methods. Furthermore, we show that it detects expected patterns in longitudinal data, repeated measurements, and cross-species comparisons. We also apply the method to a cancer cell data, uncovering an interesting relationship with the underlying telomerase genotype.
url https://doi.org/10.1038/s41598-017-14403-y
work_keys_str_mv AT jameshrfarmery telomerecataploidyagnosticmethodforestimatingtelomerelengthfromwholegenomesequencingdata
AT mikelsmith telomerecataploidyagnosticmethodforestimatingtelomerelengthfromwholegenomesequencingdata
AT nihrbioresourcerarediseases telomerecataploidyagnosticmethodforestimatingtelomerelengthfromwholegenomesequencingdata
AT andyglynch telomerecataploidyagnosticmethodforestimatingtelomerelengthfromwholegenomesequencingdata
_version_ 1724392039093633024