Word embedding for French natural language in healthcare: a comparative study (Preprint)
Résumé
Word embedding technologies, a set of language modeling and feature learning techniques in natural language processing (NLP), are now used in a wide range of applications. However, no formal evaluation and comparison have been made on the ability of each of the 3 current most famous unsupervised implementations (Word2Vec, GloVe, and FastText) to keep track of the semantic similarities existing between words, when trained on the same dataset.
Domaines
Ingénierie biomédicaleOrigine | Fichiers éditeurs autorisés sur une archive ouverte |
---|
Loading...