Which Discriminator for Cooperative Text Generation?

Antoine Chaffin; Thomas Scialom; Sylvain Lamprier; Jacopo Staiano; Benjamin Piwowarski; Ewa Kijak; Vincent Claveau

doi:10.1145/3477495.3531858

Communication Dans Un Congrès Année : 2022

Which Discriminator for Cooperative Text Generation?

(1, 2) , (3, 4) , (4) , (3) , (4) , (2) , (2)

1
2
3
4

Antoine Chaffin

Fonction : Auteur

IMATAG [Rennes]

Creating and exploiting explicit links between multimedia fragments

Thomas Scialom

Fonction : Auteur

reciTAL

Machine Learning and Information Access

Sylvain Lamprier

Fonction : Auteur
PersonId : 740402
IdHAL : sylvain-lamprier
ORCID : 0000-0002-2508-922X
IdRef : 142632201

Machine Learning and Information Access

Jacopo Staiano

Fonction : Auteur

reciTAL

Benjamin Piwowarski

Fonction : Auteur
PersonId : 9362
IdHAL : benjamin-piwowarski
ORCID : 0000-0001-6792-3262
IdRef : 226846601

Machine Learning and Information Access

Ewa Kijak

Fonction : Auteur
PersonId : 20756
IdHAL : ekijak
IdRef : 07598640X

Creating and exploiting explicit links between multimedia fragments

Vincent Claveau

Fonction : Auteur
PersonId : 5270
IdHAL : vincent-claveau
ORCID : 0000-0002-3459-0550
IdRef : 075988216

Creating and exploiting explicit links between multimedia fragments

Résumé

Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation step, resulting in a cooperative environment where the classifier guides the decoding of the language model distribution towards relevant texts for the task at hand. In this paper, we examine three families of (transformer-based) discriminators for this specific task of cooperative decoding: bidirectional, left-to-right and generative ones. We evaluate the pros and cons of these different types of discriminators for cooperative generation, exploring respective accuracy on classification tasks along with their impact on the resulting sample quality and computational performances. We also provide the code of a batched implementation of the powerful cooperative decoding strategy used for our experiments, the Monte Carlo Tree Search, working with each discriminator for Natural Language Generation.

Mots clés

natural language generation cooperative discriminator monte carlo tree search attention empirical performance

Domaines

Recherche d'information [cs.IR] Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Traitement du texte et du document

Fichier principal

Which_discrim_arXiv.pdf (749.34 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Benjamin Piwowarski : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-03718429

Soumis le : vendredi 21 octobre 2022-11:20:24

Dernière modification le : jeudi 7 novembre 2024-14:16:02

Dates et versions

hal-03718429 , version 1 (21-10-2022)

Licence

Paternité

Identifiants

HAL Id : hal-03718429 , version 1
ARXIV : 2204.11586
DOI : 10.1145/3477495.3531858

Citer

Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, et al.. Which Discriminator for Cooperative Text Generation?. SIGIR 2022 - 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2022, Madrid, Spain. pp.2360-2365, ⟨10.1145/3477495.3531858⟩. ⟨hal-03718429⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA ISIR CENTRALESUPELEC INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SORBONNE-UNIVERSITE SU-SCIENCES UR1-MATH-NUM CYBERSCHOOL ISIR_MLIA

130 Consultations

99 Téléchargements

Which Discriminator for Cooperative Text Generation?

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager