Self-Attention Architectures for Answer-Agnostic Neural Question Generation

Thomas Scialom; Benjamin Piwowarski; Jacopo Staiano

doi:10.18653/v1/P19-1604

Communication Dans Un Congrès Année : 2019

Self-Attention Architectures for Answer-Agnostic Neural Question Generation

(1) , (2) ,

1
2

Thomas Scialom

Fonction : Auteur

Machine Learning and Information Access

Benjamin Piwowarski

Fonction : Auteur
PersonId : 9362
IdHAL : benjamin-piwowarski
ORCID : 0000-0001-6792-3262
IdRef : 226846601

Bases de Données

Jacopo Staiano

Fonction : Auteur

Résumé

Neural architectures based on self-attention, such as Transformers, recently attracted interest from the research community, and obtained significant improvements over the state of the art in several tasks. We explore how Transformers can be adapted to the task of Neural Question Generation without constraining the model to focus on a specific answer passage. We study the effect of several strategies to deal with out-of-vocabulary words such as copy mechanisms, placeholders, and contextual word embeddings. We report improvements obtained over the state-of-the-art on the SQuAD dataset according to automated metrics (BLEU, ROUGE), as well as qualitative human assessments of the system outputs.

Domaines

Recherche d'information [cs.IR] Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Traitement du texte et du document

Benjamin Piwowarski : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-02350993

Soumis le : mercredi 6 novembre 2019-11:19:32

Dernière modification le : mercredi 30 octobre 2024-13:32:37

Dates et versions

hal-02350993 , version 1 (06-11-2019)

Identifiants

HAL Id : hal-02350993 , version 1
DOI : 10.18653/v1/P19-1604

Citer

Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano. Self-Attention Architectures for Answer-Agnostic Neural Question Generation. ACL 2019 - Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy. pp.6027-6032, ⟨10.18653/v1/P19-1604⟩. ⟨hal-02350993⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

109 Consultations

0 Téléchargements

Self-Attention Architectures for Answer-Agnostic Neural Question Generation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager