Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture

Erwan Renaudo; Benoît Girard; Raja Chatila; Mehdi Khamassi

doi:10.1016/j.procs.2015.12.194

Article Dans Une Revue Procedia Computer Science Année : 2015

Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture

(1, 2) , (1, 2) , (1, 2) , (1, 2)

1
2

Erwan Renaudo

Fonction : Auteur correspondant
PersonId : 8523
IdHAL : erwanrenaudo
ORCID : 0000-0003-3282-8972
IdRef : 200378791

Connectez-vous pour contacter l'auteur

Institut des Systèmes Intelligents et de Robotique

AMAC

Benoît Girard

Fonction : Auteur
PersonId : 1537
IdHAL : benoit-girard
ORCID : 0000-0002-8117-7064
IdRef : 089381092

Institut des Systèmes Intelligents et de Robotique

AMAC

Raja Chatila

Fonction : Auteur
PersonId : 174618
IdHAL : raja-chatila
ORCID : 0000-0001-7822-0634
IdRef : 05977018X

Institut des Systèmes Intelligents et de Robotique

AMAC

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

AMAC

Résumé

Combining model-based and model-free reinforcement learning systems in robotic cognitive architectures appears as a promising direction to endow artificial agents with flexibility and decisional autonomy close to mammals. In particular, it could enable robots to build an internal model of the environment, plan within it in response to detected environmental changes, and avoid the cost and time of planning when the stability of the environment is recognized as enabling habit learning. However, previously proposed criteria for the coordination of these two learning systems do not scale up to the large, partial and uncertain models autonomously learned by robots. Here we precisely analyze the performances of these two systems in an asynchronous robotic simulation of a cube-pushing task requiring a permanent trade-off between speed and accuracy. We propose solutions to make learning successful in these conditions. We finally discuss possible criteria for their efficient coordination within robotic cognitive architectures.

Mots clés

Robotic Cognitive Architecture Reinforcement Learning Biological Inspiration

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

1-s2.0-S1877050915036558-main.pdf (512.91 Ko)

Origine	Publication financée par une institution

Gestionnaire HAL-UPMC : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01250157

Soumis le : lundi 4 janvier 2016-13:24:27

Dernière modification le : mercredi 27 mars 2024-15:02:03

Archivage à long terme le : vendredi 15 avril 2016-16:01:59

Dates et versions

hal-01250157 , version 1 (04-01-2016)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

HAL Id : hal-01250157 , version 1
DOI : 10.1016/j.procs.2015.12.194

Citer

Erwan Renaudo, Benoît Girard, Raja Chatila, Mehdi Khamassi. Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture. Procedia Computer Science, 2015, 71, pp.178-184. ⟨10.1016/j.procs.2015.12.194⟩. ⟨hal-01250157⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ISIR_AMAC

187 Consultations

138 Téléchargements

Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager