Jointly aligning a group of DNA reads improves accuracy of identifying large deletions

Anish M S Shrestha; Martin C. Frith; Kiyoshi Asai; Hugues Richard

doi:10.1093/nar/gkx1175

Article Dans Une Revue Nucleic Acids Research Année : 2018

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions

(1) , (1, 2) , (1, 2) , (3)

1
2
3

Anish M S Shrestha

Fonction : Auteur

The University of Tokyo

Martin C. Frith

Fonction : Auteur
PersonId : 961948

The University of Tokyo

National Institute of Advanced Industrial Science and Technology

Kiyoshi Asai

Fonction : Auteur

The University of Tokyo

National Institute of Advanced Industrial Science and Technology

Hugues Richard

Fonction : Auteur

Biologie Computationnelle et Quantitative = Laboratory of Computational and Quantitative Biology

Résumé

Performing sequence alignment to identify structural variants, such as large deletions, from genome se-quencing data is a fundamental task, but current methods are far from perfect. The current practice is to independently align each DNA read to a reference genome. We show that the propensity of ge-nomic rearrangements to accumulate in repeat-rich regions imposes severe ambiguities in these alignments , and consequently on the variant calls––with current read lengths, this affects more than one third of known large deletions in the C. Venter genome. We present a method to jointly align reads to a genome, whereby alignment ambiguity of one read can be disambiguated by other reads. We show this leads to a significant improvement in the accuracy of identifying large deletions (≥20 bases), while imposing minimal computational overhead and maintaining an overall running time that is at par with current tools. A software implementation is available as an open-source Python program called JRA at https://bitbucket.org/jointreadalignment/jra-src.

Domaines

Bio-Informatique, Biologie Systémique [q-bio.QM]

Fichier principal

gkx1175.pdf (990.25 Ko)

Origine	Publication financée par une institution

Gestionnaire HAL 2 Sorbonne Université : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01727521

Soumis le : vendredi 9 mars 2018-11:30:26

Dernière modification le : mercredi 30 octobre 2024-13:10:52

Archivage à long terme le : dimanche 10 juin 2018-13:49:51

Dates et versions

hal-01727521 , version 1 (09-03-2018)

Licence

Paternité

Identifiants

HAL Id : hal-01727521 , version 1
DOI : 10.1093/nar/gkx1175

Citer

Anish M S Shrestha, Martin C. Frith, Kiyoshi Asai, Hugues Richard. Jointly aligning a group of DNA reads improves accuracy of identifying large deletions. Nucleic Acids Research, 2018, 46 (3), pp.e18. ⟨10.1093/nar/gkx1175⟩. ⟨hal-01727521⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LCQB LCQB-AG IBPS SORBONNE-UNIVERSITE SU-SCIENCES

917 Consultations

61 Téléchargements

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager