Abstract
The discovery of driver mutations is one of the key motivations for cancer genome sequencing. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we describe DriverPower, a software package that uses mutational burden and functional impact evidence to identify driver mutations in coding and non-coding sites within cancer whole genomes. Using a total of 1373 genomic features derived from public sources, DriverPower's background mutation model explains up to 93% of the regional variance in the mutation rate across multiple tumour types. By incorporating functional impact scores, we are able to further increase the accuracy of driver discovery. Testing across a collection of 2583 cancer genomes from the PCAWG project, DriverPower identifies 217 coding and 95 non-coding driver candidates. Comparing to six published methods used by the PCAWG Drivers and Functional Interpretation Working Group, DriverPower has the highest F1 score for both coding and non-coding driver discovery. This demonstrates that DriverPower is an effective framework for computational driver discovery.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 734 |
| Journal | Nature communications |
| Volume | 11 |
| Issue number | 1 |
| DOIs | |
| State | Published - Feb 5 2020 |
Keywords
- Algorithms
- Genome, Human
- Genomics/methods
- Humans
- MEF2 Transcription Factors/genetics
- Mutation
- Mutation Rate
- Neoplasms/genetics
- Peptide Elongation Factor 1/genetics
- Receptors, G-Protein-Coupled/genetics
- Software
- Whole Genome Sequencing
PubMed: MeSH publication types
- Journal Article
- Research Support, Non-U.S. Gov't