Statistical inference for the evolutionary history of cancer genomes - Sorbonne Université Accéder directement au contenu
Article Dans Une Revue Statistical Science Année : 2020

Statistical inference for the evolutionary history of cancer genomes

Résumé

Recent years have seen considerable work on inference about cancer evolution from mutations identified in cancer samples. Much of the modeling work has been based on classical models of population genetics , generalized to accommodate time-varying cell population size. Reverse-time, genealogical views of such models, commonly known as coalescents, have been used to infer aspects of the past of growing populations. Another approach is to use branching processes, the simplest scenario being the classical linear birth-death process. Inference from evolutionary models of DNA often exploits summary statistics of the sequence data, a common one being the so-called Site Frequency Spectrum (SFS). In a bulk tumor sequencing experiment we can estimate for each site at which a novel somatic point mutation has arisen, the proportion of cells that carry that mutation. These numbers are then grouped into collections of sites which have similar mutant fractions. We examine how the SFS based on birth-death processes differs from those based on the coalescent model. This may stem from the different sampling mechanisms in the two approaches. However, we also show that despite this, they are quantitatively comparable for the range of parameters typical for tumor cell populations. We also present a model of tumor evolution with selective sweeps, and demonstrate how it may help in understanding the history of a tumor as well as the influence of data pre-processing. We illustrate the theory with applications to
Fichier principal
Vignette du fichier
StatSci_Main_Text.pdf (2.43 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02988174 , version 1 (04-11-2020)

Identifiants

Citer

Khanh N Dinh, Roman Jaksik, Marek Kimmel, Amaury Lambert, Simon Tavare. Statistical inference for the evolutionary history of cancer genomes. Statistical Science, 2020, 35 (1), pp.129-144. ⟨10.1214/19-STS7561⟩. ⟨hal-02988174⟩
49 Consultations
128 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More