To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs - Sorbonne Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

Thomas Scialom
  • Fonction : Auteur
  • PersonId : 1109225
Paul-Alexis Dray
  • Fonction : Auteur
  • PersonId : 1112181
Jacopo Staiano
  • Fonction : Auteur
  • PersonId : 1109226
Sylvain Lamprier

Résumé

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.
Fichier principal
Vignette du fichier
NeurIPS-2021-to-beam-or-not-to-beam-that-is-a-question-of-cooperation-for-language-gans-Paper.pdf (511.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03923314 , version 1 (04-01-2023)

Identifiants

  • HAL Id : hal-03923314 , version 1

Citer

Thomas Scialom, Paul-Alexis Dray, Jacopo Staiano, Sylvain Lamprier, Benjamin Piwowarski. To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs. Advances in Neural Information Processing Systems, Dec 2021, Virtual, United States. pp.26585--26597. ⟨hal-03923314⟩
13 Consultations
4 Téléchargements

Partager

Gmail Facebook X LinkedIn More