A Search Engine to Access PubMed Monolingual Subsets: Proof of Concept and Evaluation in French

Abstract : Background: PubMed contains numerous articles in languages other than English. However, existing solutions to access these articles in the language in which they were written remain unconvincing. Objective: The aim of this study was to propose a practical search engine, called Multilingual PubMed, which will permit access to a PubMed subset in 1 language and to evaluate the precision and coverage for the French version (Multilingual PubMed-French). Methods: To create this tool, translations of MeSH were enriched (eg, adding synonyms and translations in French) and integrated into a terminology portal. PubMed subsets in several European languages were also added to our database using a dedicated parser. The response time for the generic semantic search engine was evaluated for simple queries. BabelMeSH, Multilingual PubMed-French, and 3 different PubMed strategies were compared by searching for literature in French. Precision and coverage were measured for 20 randomly selected queries. The results were evaluated as relevant to title and abstract, the evaluator being blind to search strategy. Results: More than 650,000 PubMed citations in French were integrated into the Multilingual PubMed-French information system. The response times were all below the threshold defined for usability (2 seconds). Two search strategies (Multilingual PubMed-French and 1 PubMed strategy) showed high precision (0.93 and 0.97, respectively), but coverage was 4 times higher for Multilingual PubMed-French. Conclusions: It is now possible to freely access biomedical literature using a practical search tool in French. This tool will be of particular interest for health professionals and other end users who do not read or query sufficiently in English. The information system is theoretically well suited to expand the approach to other European languages, such as German, Spanish, Norwegian, and Portuguese.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.sorbonne-universite.fr/hal-01329375
Contributor : Gestionnaire Hal-Upmc <>
Submitted on : Thursday, June 9, 2016 - 10:29:20 AM
Last modification on : Thursday, March 21, 2019 - 2:17:16 PM

File

fc-xsltGalley-3836-48751-7-PB....
Publication funded by an institution

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Nicolas Griffon, Matthieu Schuers, Lina Fatima Soualmia, Julien Grosjean, Gaétan Kerdelhué, et al.. A Search Engine to Access PubMed Monolingual Subsets: Proof of Concept and Evaluation in French. Journal of Medical Internet Research, Journal of Medical Internet Research, 2014, 16 (12), pp.e271. ⟨10.2196/jmir.3836⟩. ⟨hal-01329375⟩

Share

Metrics

Record views

959

Files downloads

202