Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture - Sorbonne Université Access content directly
Journal Articles International Journal of Social Robotics Year : 2023

Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture

Rémi Dromnelle
  • Function : Author
Erwan Renaudo
Mohamed Chetouani
Petros Maragos
  • Function : Author
  • PersonId : 843146
Raja Chatila
Benoît Girard

Abstract

We present a new neuro-inspired reinforcement learning architecture for robot online learning and decision-making during both social and non-social scenarios. The goal is to take inspiration from the way humans dynamically and autonomously adapt their behavior according to variations in their own performance while minimizing cognitive effort. Following computational neuroscience principles, the architecture combines model-based (MB) and model-free (MF) reinforcement learning (RL). The main novelty here consists in arbitrating with a meta-controller which selects the current learning strategy according to a trade-off between efficiency and computational cost. The MB strategy, which builds a model of the long-term effects of actions and uses this model to decide through dynamic programming, enables flexible adaptation to task changes at the expense of high computation costs. The MF strategy is less flexible but also 1000 times less costly, and learns by observation of MB decisions. We test the architecture in three experiments: a navigation task in a real environment with task changes (wall configuration changes, goal location changes); a simulated object manipulation task under human teaching signals; and a simulated human–robot cooperation task to tidy up objects on a table. We show that our human-inspired strategy coordination method enables the robot to maintain an optimal performance in terms of reward and computational cost compared to an MB expert alone, which achieves the best performance but has the highest computational cost. We also show that the method makes it possible to cope with sudden changes in the environment, goal changes or changes in the behavior of the human partner during interaction tasks. The robots that performed these experiments, whether real or virtual, all used the same set of parameters, thus showing the generality of the method.
Fichier principal
Vignette du fichier
Dromnelle2022preprint.pdf (16.51 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03829879 , version 1 (24-11-2022)

Identifiers

Cite

Rémi Dromnelle, Erwan Renaudo, Mohamed Chetouani, Petros Maragos, Raja Chatila, et al.. Reducing computational cost during robot navigation and human-robot interaction with a human-inspired reinforcement learning architecture. International Journal of Social Robotics, 2023, 15, pp.1297-1323. ⟨10.1007/s12369-022-00942-6⟩. ⟨hal-03829879⟩
155 View
239 Download

Altmetric

Share

Gmail Facebook X LinkedIn More