R. Alami, R. Chatila, S. Fleury, M. Ghallab, and F. Ingrand, An Architecture for Autonomy, The International Journal of Robotics Research, vol.17, issue.4, pp.315-337, 1998.
DOI : 10.1177/027836499801700402

URL : https://hal.archives-ouvertes.fr/hal-00123273

B. W. Balleine and J. P. Doherty, Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action, Neuropsychopharmacology, vol.20, issue.1, pp.48-69, 2010.
DOI : 10.1016/S0149-7634(99)00065-2

K. Caluwaerts, M. Staffa, S. N-'guyen, C. Grand, L. Dollé et al., A biologically inspired meta-control navigation system for the Psikharpax rat robot, Bioinspiration & Biomimetics, vol.7, issue.2, p.25009, 2012.
DOI : 10.1088/1748-3182/7/2/025009

URL : https://hal.archives-ouvertes.fr/hal-01000945

N. D. Daw, Y. Niv, and P. Dayan, Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, Nature Neuroscience, vol.58, issue.12, pp.1704-1711, 2005.
DOI : 10.1038/nn1560

A. Dezfouli and B. W. Balleine, Habits, action sequences and reinforcement learning, European Journal of Neuroscience, vol.28, issue.33, pp.1036-1051, 2012.
DOI : 10.1111/j.1460-9568.2012.08050.x

Q. J. Huys, N. Eshel, E. O-'nions, L. Sheridan, P. Dayan et al., Bonsai Trees in Your Head: How the Pavlovian System Sculpts Goal-Directed Choices by Pruning Decision Trees, PLoS Computational Biology, vol.90, issue.3, p.2012
DOI : 10.1371/journal.pcbi.1002410.g007

M. Keramati, A. Dezfouli, and P. Piray, Speed/Accuracy Trade-Off between the Habitual and the Goal-Directed Processes, PLoS Computational Biology, vol.35, issue.5, pp.1-25, 2011.
DOI : 10.1371/journal.pcbi.1002055.t002

M. Khamassi, S. Lallée, P. Enel, E. Procyk, and P. F. Dominey, Robot Cognitive Control with a Neurophysiologically Inspired Reinforcement Learning Model, Frontiers in Neurorobotics, vol.5, issue.1, 2011.
DOI : 10.3389/fnbot.2011.00001

URL : https://hal.archives-ouvertes.fr/hal-00688931

M. Quigley, K. Conley, B. P. Gerkey, J. Faust, T. Foote et al., Ros: an open-source robot operating system, ICRA Workshop on Open Source Software, 2009.

E. Renaudo, S. Devin, B. Girard, R. Chatila, R. Alami et al., Learning to interact with humans using goal-directed and habitual behaviors, Workshop on Learning for Human-Robot Collaboration, 2015.

E. Renaudo, B. Girard, R. Chatila, and M. Khamassi, Design of a Control Architecture for Habit Learning in Robots, Biomimetic and Biohybrid Systems, LNAI Proceedings, pp.249-260, 2014.
DOI : 10.1007/978-3-319-09435-9_22

URL : https://hal.archives-ouvertes.fr/hal-01312443

E. Renaudo, B. Girard, R. Chatila, and M. Khamassi, Which criteria for autonomously shifting between goal-directed and habitual behaviors in robots?, 2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), 2015.
DOI : 10.1109/DEVLRN.2015.7346152

URL : https://hal.archives-ouvertes.fr/hal-01312449

T. Siméon, J. Laumond, and F. Lamiraux, Move3D: A generic platform for path planning, Proceedings of the 2001 IEEE International Symposium on Assembly and Task Planning (ISATP2001). Assembly and Disassembly in the Twenty-first Century. (Cat. No.01TH8560), pp.25-30, 2001.
DOI : 10.1109/ISATP.2001.928961

C. Watkins, Learning from Delayed Rewards King's College, 1989.