Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes

Marie Hully; Tommaso Lo Barco; Anna Kaminska; Giulia Barcia; Claude Cances; Cyril Mignot; Isabelle Desguerre; Nicolas Garcelon; Edor Kabashi; Rima Nabbout

doi:10.1038/s41436-020-01039-z

Article Dans Une Revue Genetics in Medicine Année : 2021

Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes

(1) , (1) , (1) , (1) , (2) , (3, 4) , (1) , (5, 6) , (5) , (1, 5)

1
2
3
4
5
6

Marie Hully

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Tommaso Lo Barco

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Anna Kaminska

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Giulia Barcia

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Claude Cances

Fonction : Auteur

Centre Hospitalier Universitaire de Toulouse

Cyril Mignot

Fonction : Auteur

CHU Pitié-Salpêtrière [AP-HP]

Sorbonne Université - Faculté de Médecine

Isabelle Desguerre

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Nicolas Garcelon

Fonction : Auteur
PersonId : 764876
ORCID : 0000-0002-3326-2811
IdRef : 234179031

Imagine - Institut des maladies génétiques (IHU)

Centre de Recherche des Cordeliers

Edor Kabashi

Fonction : Auteur
PersonId : 774219
ORCID : 0000-0003-4118-251X

Imagine - Institut des maladies génétiques (IHU)

Rima Nabbout

Fonction : Auteur

Hôpital Necker - Enfants Malades [AP-HP]

Imagine - Institut des maladies génétiques (IHU)

Résumé

Purpose: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a transposable and sustainable methodology to search for patients presenting a high similitude of clinical features. Methods: Exome and targeted next-generation sequencing bioinformatics analyses were performed at the Imagine Institute. Similarity Index (SI), an algorithm based on a vector space model (VSM) that exploits concepts extracted from clinical narrative reports was used to identify patients with highly similar clinical features. Results: Here we describe a case of "automated diagnosis" indicated by Dr. Warehouse, a biomedical data warehouse oriented toward clinical narrative reports, developed at Necker Children's Hospital using around 500,000 patients' records. Through the use of this warehouse, we were able to match and identify two patients sharing very specific clinical neonatal and childhood features harboring the same de novo variant in KCNA2. Conclusion: This innovative application of database clustering clinical features could advance identification of patients with rare and common genetic conditions and detect with high accuracy the natural history of patients harboring similar genetic pathogenic variants.

Domaines

Sciences du Vivant [q-bio]

Fichier principal

s41436-020-01039-z.pdf (746.22 Ko)

Origine	Publication financée par une institution

Gestionnaire HAL 4 Sorbonne Université : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-03967477

Soumis le : lundi 1 février 2021-11:44:26

Dernière modification le : vendredi 19 avril 2024-16:18:59

Archivage à long terme le : dimanche 2 mai 2021-19:07:17

Dates et versions

hal-03967477 , version 1 (01-02-2021)

hal-03967477 , version 2 (27-11-2023)

Identifiants

HAL Id : hal-03967477 , version 1
DOI : 10.1038/s41436-020-01039-z
PUBMED : 33500571

Citer

Marie Hully, Tommaso Lo Barco, Anna Kaminska, Giulia Barcia, Claude Cances, et al.. Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes. Genetics in Medicine, 2021, ⟨10.1038/s41436-020-01039-z⟩. ⟨hal-03967477v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

93 Consultations

76 Téléchargements

Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager