The landscape and driver potential of site-specific hotspots across cancer genomes

Abstract Large sets of whole cancer genomes make it possible to study mutation hotspots genome-wide. Here we detect, categorize, and characterize site-specific hotspots using 2279 whole cancer genomes from the Pan-Cancer Analysis of Whole Genomes project and provide a resource of annotated hotspots...

Full description

Bibliographic Details
Main Authors: Randi Istrup Juul, Morten Muhlig Nielsen, Malene Juul, Lars Feuerbach, Jakob Skou Pedersen
Format: Article
Language:English
Published: Nature Publishing Group 2021-05-01
Series:npj Genomic Medicine
Online Access:https://doi.org/10.1038/s41525-021-00197-6
id doaj-d01e75884cdc498f9e404ea996710e9f
record_format Article
spelling doaj-d01e75884cdc498f9e404ea996710e9f2021-05-16T11:32:55ZengNature Publishing Groupnpj Genomic Medicine2056-79442021-05-016111510.1038/s41525-021-00197-6The landscape and driver potential of site-specific hotspots across cancer genomesRandi Istrup Juul0Morten Muhlig Nielsen1Malene Juul2Lars Feuerbach3Jakob Skou Pedersen4Department of Molecular Medicine, Aarhus University HospitalDepartment of Molecular Medicine, Aarhus University HospitalDepartment of Molecular Medicine, Aarhus University HospitalDivision of Applied Bioinformatics, German Cancer Research CenterDepartment of Molecular Medicine, Aarhus University HospitalAbstract Large sets of whole cancer genomes make it possible to study mutation hotspots genome-wide. Here we detect, categorize, and characterize site-specific hotspots using 2279 whole cancer genomes from the Pan-Cancer Analysis of Whole Genomes project and provide a resource of annotated hotspots genome-wide. We investigate the excess of hotspots in both protein-coding and gene regulatory regions and develop measures of positive selection and functional impact for individual hotspots. Using cancer allele fractions, expression aberrations, mutational signatures, and a variety of genomic features, such as potential gain or loss of transcription factor binding sites, we annotate and prioritize all highly mutated hotspots. Genome-wide we find more high-frequency SNV and indel hotspots than expected given mutational background models. Protein-coding regions are generally enriched for SNV hotspots compared to other regions. Gene regulatory hotspots show enrichment of potential same-patient second-hit missense mutations, consistent with enrichment of hotspot driver mutations compared to singletons. For protein-coding regions, splice-sites, promoters, and enhancers, we see an excess of hotspots associated with cancer genes. Interestingly, missense hotspot mutations in tumor suppressors are associated with elevated expression, suggesting localized amino-acid changes with functional impact. For individual non-coding hotspots, only a small number show clear signs of positive selection, including known sites in the TERT promoter and the 5’ UTR of TP53. Most of the new candidates have few mutations and limited driver evidence. However, a hotspot in an enhancer of the oncogene POU2AF1, which may create a transcription factor binding site, presents multiple lines of driver-consistent evidence.https://doi.org/10.1038/s41525-021-00197-6
collection DOAJ
language English
format Article
sources DOAJ
author Randi Istrup Juul
Morten Muhlig Nielsen
Malene Juul
Lars Feuerbach
Jakob Skou Pedersen
spellingShingle Randi Istrup Juul
Morten Muhlig Nielsen
Malene Juul
Lars Feuerbach
Jakob Skou Pedersen
The landscape and driver potential of site-specific hotspots across cancer genomes
npj Genomic Medicine
author_facet Randi Istrup Juul
Morten Muhlig Nielsen
Malene Juul
Lars Feuerbach
Jakob Skou Pedersen
author_sort Randi Istrup Juul
title The landscape and driver potential of site-specific hotspots across cancer genomes
title_short The landscape and driver potential of site-specific hotspots across cancer genomes
title_full The landscape and driver potential of site-specific hotspots across cancer genomes
title_fullStr The landscape and driver potential of site-specific hotspots across cancer genomes
title_full_unstemmed The landscape and driver potential of site-specific hotspots across cancer genomes
title_sort landscape and driver potential of site-specific hotspots across cancer genomes
publisher Nature Publishing Group
series npj Genomic Medicine
issn 2056-7944
publishDate 2021-05-01
description Abstract Large sets of whole cancer genomes make it possible to study mutation hotspots genome-wide. Here we detect, categorize, and characterize site-specific hotspots using 2279 whole cancer genomes from the Pan-Cancer Analysis of Whole Genomes project and provide a resource of annotated hotspots genome-wide. We investigate the excess of hotspots in both protein-coding and gene regulatory regions and develop measures of positive selection and functional impact for individual hotspots. Using cancer allele fractions, expression aberrations, mutational signatures, and a variety of genomic features, such as potential gain or loss of transcription factor binding sites, we annotate and prioritize all highly mutated hotspots. Genome-wide we find more high-frequency SNV and indel hotspots than expected given mutational background models. Protein-coding regions are generally enriched for SNV hotspots compared to other regions. Gene regulatory hotspots show enrichment of potential same-patient second-hit missense mutations, consistent with enrichment of hotspot driver mutations compared to singletons. For protein-coding regions, splice-sites, promoters, and enhancers, we see an excess of hotspots associated with cancer genes. Interestingly, missense hotspot mutations in tumor suppressors are associated with elevated expression, suggesting localized amino-acid changes with functional impact. For individual non-coding hotspots, only a small number show clear signs of positive selection, including known sites in the TERT promoter and the 5’ UTR of TP53. Most of the new candidates have few mutations and limited driver evidence. However, a hotspot in an enhancer of the oncogene POU2AF1, which may create a transcription factor binding site, presents multiple lines of driver-consistent evidence.
url https://doi.org/10.1038/s41525-021-00197-6
work_keys_str_mv AT randiistrupjuul thelandscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT mortenmuhlignielsen thelandscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT malenejuul thelandscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT larsfeuerbach thelandscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT jakobskoupedersen thelandscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT randiistrupjuul landscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT mortenmuhlignielsen landscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT malenejuul landscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT larsfeuerbach landscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
AT jakobskoupedersen landscapeanddriverpotentialofsitespecifichotspotsacrosscancergenomes
_version_ 1721439404051922944