Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture

Rémi Dromnelle; Erwan Renaudo; Mohamed Chetouani; Petros Maragos; Raja Chatila; Benoît Girard; Mehdi Khamassi

doi:10.1007/s12369-022-00942-6

Article Dans Une Revue International Journal of Social Robotics Année : 2023

Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture

, , , , , , (1)

Rémi Dromnelle

Fonction : Auteur

Erwan Renaudo

Fonction : Auteur
PersonId : 8523
IdHAL : erwanrenaudo
ORCID : 0000-0003-3282-8972
IdRef : 200378791

Mohamed Chetouani

Fonction : Auteur
PersonId : 179528
IdHAL : mohamed-chetouani
ORCID : 0000-0002-2920-4539
IdRef : 089021916

Petros Maragos

Fonction : Auteur
PersonId : 843146

Raja Chatila

Fonction : Auteur
PersonId : 174618
IdHAL : raja-chatila
ORCID : 0000-0001-7822-0634
IdRef : 05977018X

Benoît Girard

Fonction : Auteur
PersonId : 1537
IdHAL : benoit-girard
ORCID : 0000-0002-8117-7064
IdRef : 089381092

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

Résumé

We present a new neuro-inspired reinforcement learning architecture for robot online learning and decision-making during both social and non-social scenarios. The goal is to take inspiration from the way humans dynamically and autonomously adapt their behavior according to variations in their own performance while minimizing cognitive effort. Following computational neuroscience principles, the architecture combines model-based (MB) and model-free (MF) reinforcement learning (RL). The main novelty here consists in arbitrating with a meta-controller which selects the current learning strategy according to a trade-off between efficiency and computational cost. The MB strategy, which builds a model of the long-term effects of actions and uses this model to decide through dynamic programming, enables flexible adaptation to task changes at the expense of high computation costs. The MF strategy is less flexible but also 1000 times less costly, and learns by observation of MB decisions. We test the architecture in three experiments: a navigation task in a real environment with task changes (wall configuration changes, goal location changes); a simulated object manipulation task under human teaching signals; and a simulated human–robot cooperation task to tidy up objects on a table. We show that our human-inspired strategy coordination method enables the robot to maintain an optimal performance in terms of reward and computational cost compared to an MB expert alone, which achieves the best performance but has the highest computational cost. We also show that the method makes it possible to cope with sudden changes in the environment, goal changes or changes in the behavior of the human partner during interaction tasks. The robots that performed these experiments, whether real or virtual, all used the same set of parameters, thus showing the generality of the method.

Mots clés

Strategy coordination Cognitive monitoring Reinforcement learning Robot cognitive architecture Navigation HRI Neuro-inspiration

Domaines

Intelligence artificielle [cs.AI] Automatique / Robotique

Fichier principal

Dromnelle2022preprint.pdf (16.51 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Mehdi Khamassi : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-03829879

Soumis le : jeudi 24 novembre 2022-13:44:56

Dernière modification le : mercredi 30 octobre 2024-13:29:06

Archivage à long terme le : samedi 25 février 2023-19:36:29

Dates et versions

hal-03829879 , version 1 (24-11-2022)

Identifiants

HAL Id : hal-03829879 , version 1
DOI : 10.1007/s12369-022-00942-6

Citer

Rémi Dromnelle, Erwan Renaudo, Mohamed Chetouani, Petros Maragos, Raja Chatila, et al.. Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture. International Journal of Social Robotics, 2023, 15, pp.1297-1323. ⟨10.1007/s12369-022-00942-6⟩. ⟨hal-03829879⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR TDS-MACS SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_AMAC ISIR_PIROS

193 Consultations

392 Téléchargements

Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager