Vers une vérification automatique des affirmations statistiques

Abstract : Digital content is increasingly produced nowadays in a variety of media such as news and social network sites, personal Web sites, blogs etc. In particular, a large and dynamic part of such content is related to media-worthy events, whether of general interest (e.g., the war in Syria) or of specialized interest to a sub-community of users (e.g., sport events or genetically modified organisms). While such content is primarily meant for the human users (readers), interest is growing in its automatic analysis, understanding and exploitation. Within the ANR project ContentCheck, we are interested in developing textual and semantic tools for analyzing content shared through digital media. The proposed PhD project takes place within this contract, and will be developed based on the interactions with our partner from Le Monde. The PhD project aims at developing algorithms and tools for :Classifying and annotating mixed content (from articles, structured databases, social media etc.) based on an existing set of topics (or ontology) ;Information and relation extraction from a text which may comprise a statement to be fact-checked, with a particular focus on capturing the time dimension ; a sample statement is for instance « VAT on iron in France was the highest in Europe in 2015 ».Building structured queries from extracted information and relations, to be evaluated against reference databases used as trusted information against which facts can be checked.
Complete list of metadatas

Cited literature [150 references]  Display  Hide  Download
Contributor : Abes Star <>
Submitted on : Monday, January 13, 2020 - 3:55:18 PM
Last modification on : Wednesday, January 15, 2020 - 1:24:34 AM


Version validated by the jury (STAR)


  • HAL Id : tel-02437183, version 1


Tien Duc Cao. Vers une vérification automatique des affirmations statistiques. Programming Languages [cs.PL]. Université Paris-Saclay, 2019. English. ⟨NNT : 2019SACLX051⟩. ⟨tel-02437183⟩



Record views


Files downloads