Assessing rare diseases prevalence using literature quantification
Abstract
Introduction
Estimating the prevalence of diseases is crucial for the organization of healthcare. The amount of literature on a rare pathology could help differentiate between rare and very rare diseases. The objective of this work was to evaluate to what extent the number of publications can be used to predict the prevalence of a given pathology.
Methods
We queried Orphanet for the global prevalence class for all conditions for which it was available. For these pathologies, we cross-referenced the Orphanet, MeSH, and OMIM vocabularies to assess the number of publication available on Pubmed using three different query strategies (one proposed in the literature, and two built specifically for this study). We first studied the association of the number of publications obtained by each of these query strategies with the prevalence class, then their predictive ability.
Results
Class prevalence was available for 3128 conditions, 2970 had a prevalence class < 1/1,000,000, 41 of 1–9/1,000,000, 84 of 1–9/100,000, and 33 of 1–9/10,000. We show a significant association and excellent predictive performance of the number of publication, with an AUC over 94% for the best query strategy.
Conclusion
Our study highlights the link and the excellent predictive performance of the number of publications on the prevalence of rare diseases provided by Orphanet.
Domains
Human health and pathologyOrigin | Publication funded by an institution |
---|