Sequence-to-Sequence Predictive models: from Prosody to Communicative Gestures

Fajrian Yunus; Chloé Clavel; Catherine I Pelachaud

Communication Dans Un Congrès Année : 2020

Sequence-to-Sequence Predictive models: from Prosody to Communicative Gestures

(1) , (2) , (1)

1
2

Fajrian Yunus

Fonction : Auteur
PersonId : 1076695
ORCID : 0000-0002-9896-9981
IdRef : 265470447

Institut des Systèmes Intelligents et de Robotique

Chloé Clavel

Fonction : Auteur
PersonId : 745667
IdHAL : chloe-clavel
ORCID : 0000-0003-4850-3398
IdRef : 116841281

Laboratoire Traitement et Communication de l'Information

Catherine I Pelachaud

Fonction : Auteur
PersonId : 179912
IdHAL : catherine-pelachaud
ORCID : 0000-0003-1008-0799
IdRef : 110203283

Institut des Systèmes Intelligents et de Robotique

Résumé

Communicative gestures and speech prosody are tightly linked. Our aim is to predict when gestures are performed based on prosody. We develop a model based on a seq2seq recurrent neural network with attention mechanism. The model is trained on a corpus of natural dyadic interaction where the speech prosody and the gestures have been annotated. Because the output of the model is a sequence, we use a sequence comparison technique to evaluate the model performance. We find that the model can predict certain gesture classes. In our experiment, we also replace some input features with random values to find which prosody features are pertinent. We find that the F0 is pertinent. Lastly, we also train the model on one speaker and test it with the other speaker to find whether the model is generalisable. We find that the models which we train on one speaker also works for another speaker of the same conversation.

Mots clés

Machine Learning Neural Network Gesture

Domaines

Informatique Neurosciences Psychologie Psychologie Sociologie

Fichier principal

wacai_2020_7_.pdf (489.48 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

CCSD Sciencesconf.org : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02933487

Soumis le : mardi 8 septembre 2020-14:35:49

Dernière modification le : mardi 17 septembre 2024-15:46:21

Archivage à long terme le : samedi 5 décembre 2020-03:13:14

Dates et versions

hal-02933487 , version 1 (08-09-2020)

Identifiants

HAL Id : hal-02933487 , version 1

Citer

Fajrian Yunus, Chloé Clavel, Catherine I Pelachaud. Sequence-to-Sequence Predictive models: from Prosody to Communicative Gestures. Workshop sur les Affects, Compagnons artificiels et Interactions, CNRS, Université Toulouse Jean Jaurès, Université de Bordeaux, Jun 2020, Saint Pierre d'Oléron, France. ⟨hal-02933487⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE LTCI IDS S2A SU-SCIENCES IP_PARIS WACAI2020 ISIR_PIROS INSTITUT-MINES-TELECOM

159 Consultations

169 Téléchargements

Sequence-to-Sequence Predictive models: from Prosody to Communicative Gestures

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager