Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes - Statistical Genomics and Biological Physics
Article Dans Une Revue Nature Communications Année : 2022

Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes

Résumé

Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term. We use computational approaches to accurately predict the polymorphisms segregating in a panel of 61,157 Escherichia coli genomes from the analysis of distant homologues. By comparing a context-aware Direct-Coupling Analysis modelling to a non-epistatic approach, we show that the genetic context strongly constrains the tolerable amino acids in 30% to 50% of amino-acid sites. The study of more distant species suggests the gradual build-up of genetic context over long evolutionary timescales by the accumulation of small epistatic contributions.
Fichier principal
Vignette du fichier
s41467-022-31643-3.pdf (2.16 Mo) Télécharger le fichier
Origine Publication financée par une institution

Dates et versions

hal-03721049 , version 1 (19-03-2024)

Identifiants

Citer

Lucile Vigué, Giancarlo Croce, Marie Petitjean, Etienne Ruppé, Olivier Tenaillon, et al.. Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes. Nature Communications, 2022, 13 (1), pp.4030. ⟨10.1038/s41467-022-31643-3⟩. ⟨hal-03721049⟩
173 Consultations
30 Téléchargements

Altmetric

Partager

More