Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity, Biological cybernetics, vol.83, issue.3, pp.287-299, 2000. ,
Prioritized sweeping neural DynaQ with multiple predecessors, and hippocampal replays, Conference on Biomimetic and Biohybrid Systems, pp.16-27, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01709275
Adaptive critics and the basal ganglia, Models of Information Processing in the Basal Ganglia, pp.215-232, 1995. ,
Learning to act using real-time dynamic programming, Artificial intelligence, vol.72, issue.1-2, pp.81-138, 1995. ,
Spatial decisions and neuronal activity in hippocampal projection zones in prefrontal cortex and striatum, Hippocampal Place Fields: Relevance to Learning and Memory pp, pp.289-311, 2008. ,
Coherent theta oscillations and reorganization of spike timing in the hippocampal-prefrontal network upon learning, Neuron, vol.66, issue.6, pp.921-936, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00554482
Dendrites, deep learning, and sequences in the hippocampus, Hippocampus, vol.29, issue.3, pp.239-251, 2019. ,
Two-stage model of memory trace formation: A role for "noisy" brain states, Neuroscience, vol.31, issue.3, pp.551-570, 1989. ,
A biologically inspired meta-control navigation system for the psikharpax rat robot, Bioinspiration & biomimetics, vol.7, issue.2, p.25009, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01000945
Hippocampal replays under the scrutiny of reinforcement learning models, Journal of neurophysiology, vol.120, issue.6, pp.2877-2896, 2018. ,
Decisions in changing conditions: the urgency-gating model, Journal of Neuroscience, vol.29, issue.37, pp.11560-11571, 2009. ,
Spatial memory sequence encoding and replay during modeled theta and ripple oscillations, Cognitive Computation, vol.3, issue.4, pp.554-574, 2011. ,
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, Nature neuroscience, vol.8, issue.12, p.1704, 2005. ,
Forward and reverse hippocampal place-cell sequences during ripples, Nature neuroscience, vol.10, issue.10, p.1241, 2007. ,
Analyzing interactions between navigation strategies using a computational model of action selection, International Conference on Spatial Cognition, pp.71-86, 2008. ,
Path planning versus cue responding: a bio-inspired model of switching between navigation strategies, Biological cybernetics, vol.103, issue.4, pp.299-317, 2010. ,
Interactions of spatial strategies producing generalization gradient and blocking: A computational approach, PLoS computational biology, vol.14, issue.4, p.1006092, 2018. ,
A model of hippocampally dependent navigation, using the temporal difference learning rule, Hippocampus, vol.10, issue.1, pp.1-16, 2000. ,
Replay comes of age, Annual review of neuroscience, vol.40, pp.581-602, 2017. ,
Reverse replay of behavioural sequences in hippocampal place cells during the awake state, Nature, vol.440, issue.7084, pp.680-683, 2006. ,
Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal, Psychological review, vol.113, issue.2, p.300, 2006. ,
The organization of recent and remote memories, Nature reviews Neuroscience, vol.6, issue.2, pp.119-130, 2005. ,
Selective suppression of hippocampal ripples impairs spatial memory, Nature neuroscience, vol.12, issue.10, pp.1222-1223, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00599372
Affordances. motivations, and the world graph theory, Adaptive Behavior, vol.6, issue.3-4, pp.435-471, 1998. ,
Hippocampal Replay Is Not a Simple Function of Experience, Neuron, vol.65, issue.5, pp.695-705, 2010. ,
Awake hippocampal sharpwave ripples support spatial memory, Science, vol.336, issue.6087, pp.1454-1458, 2012. ,
A unified dynamic model for learning, replay, and sharp-wave/ripples, Journal of Neuroscience, vol.35, issue.49, pp.16236-16258, 2015. ,
Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model, Neural Networks, vol.18, issue.9, pp.1163-1171, 2005. ,
Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point, Journal of Neuroscience, vol.27, issue.45, pp.12176-12189, 2007. ,
Integrating hippocampus and striatum in decision-making, Current opinion in neurobiology, vol.17, issue.6, pp.692-697, 2007. ,
Orbitofrontal cortex supports behavior and learning using inferred but not cached values, Science, vol.338, issue.6109, pp.953-956, 2012. ,
Awake replay of remote experiences in the hippocampus, Nature neuroscience, vol.12, issue.7, p.913, 2009. ,
Integrating cortico-limbic-basal ganglia architectures for learning model-based and model-free navigation strategies, Frontiers in Behavioral Neuroscience, vol.6, p.79, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01219958
Behavioral regulation and the modulation of information coding in the lateral prefrontal and cingulate cortex, Cerebral Cortex, vol.25, issue.9, pp.3197-3218, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01219972
Segregated encoding of reward-identity and stimulus-reward associations in human orbitofrontal cortex, Journal of Neuroscience, vol.33, issue.7, pp.3202-3211, 2013. ,
Hippocampus leads ventral striatum in replay of place-reward information, PLoS Biology, vol.7, issue.8, 2009. ,
Explicit memory creation during sleep demonstrates a causal role of place cells in navigation, Nature neuroscience, vol.18, issue.4, pp.493-495, 2015. ,
Memory of sequential experience in the hippocampus during slow wave sleep, Neuron, vol.36, issue.6, pp.1183-1194, 2002. ,
A sequence predicting ca3 is a flexible associator that learns and uses context to solve hippocampal-like tasks, Hippocampus, vol.6, issue.6, pp.579-590, 1996. ,
Self-improving reactive agents based on reinforcement learning, planning and teaching, Machine learning, vol.8, issue.3/4, pp.69-97, 1992. ,
Hippocampo-cortical coupling mediates memory consolidation during sleep, Nature neuroscience, vol.19, issue.7, pp.959-964, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-02365552
Prioritized memory access explains planning and hippocampal replay, Nature Neuroscience, vol.21, issue.11, p.1609, 2018. ,
Information processing in decision-making systems, The Neuroscientist, vol.18, issue.4, pp.342-359, 2012. ,
An integrative theory of prefrontal cortex function, Annual review of neuroscience, vol.24, issue.1, pp.167-202, 2001. ,
Prioritized sweeping: Reinforcement learning with less data and less time, Machine learning, vol.13, issue.1, pp.103-130, 1993. ,
The hippocampus as a spatial map: Preliminary evidence from unit activity in the freely-moving rat, Brain research, vol.34, issue.1, pp.171-175, 1971. ,
Hippocampal place cells construct reward related sequences through unexplored space, vol.4, p.6063, 2015. ,
The role of hippocampal replay in memory and planning, Current Biology, vol.28, issue.1, pp.37-50, 2018. ,
Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing, PLoS computational biology, vol.13, issue.8, p.1005684, 2017. ,
Interplay between Hippocampal Sharp-Wave-Ripple Events and Vicarious Trial and Error Behaviors in Decision Making, Neuron, vol.92, issue.5, pp.1-8, 2016. ,
Map making: Constructing, combining, and navigating abstract cognitive maps, p.810051, 2019. ,
Different time courses of learning-related activity in the prefrontal cortex and striatum, Nature, vol.433, issue.7028, p.873, 2005. ,
Efficient learning and planning within the Dyna framework, Adaptive Behavior, vol.1, issue.4, pp.437-454, 1993. ,
Replay of rule-learning related neural patterns in the prefrontal cortex during sleep, Nature Neuroscience, vol.12, issue.7, pp.919-926, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00551868
The mixed instrumental controller: using value of information to combine habitual choice and mental simulation, Frontiers in psychology, vol.4, 2013. ,
Internally generated sequences in learning and executing goal-directed behavior, Trends in Cognitive Sciences, vol.18, issue.12, pp.647-657, 2014. ,
Internally generated hippocampal sequences as a vantage point to probe future-oriented cognition, Annals of the New York Academy of Sciences, vol.1396, issue.1, pp.144-165, 2017. ,
Hippocampal place-cell sequences depict future paths to remembered goals, Nature, vol.497, issue.7447, p.74, 2013. ,
Bi-directional search, Machine intelligence, vol.6, p.10, 1971. ,
Vicarious trial and error, Nature Reviews Neuroscience, vol.17, issue.3, pp.147-159, 2016. ,
Design of a control architecture for habit learning in robots, Conference on Biomimetic and Biohybrid Systems, pp.249-260, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01312443
Computational models of memory consolidation and long-term synaptic plasticity during sleep, Neurobiology of learning and memory, vol.160, pp.32-47, 2019. ,
Hippocampal sharp-wave ripples in waking and sleeping states, Current opinion in neurobiology, vol.35, pp.6-12, 2015. ,
Transition between encoding and consolidation/replay dynamics via cholinergic modulation of can current: a modeling study, Hippocampus, vol.25, issue.9, pp.1052-1070, 2015. ,
A neural substrate of prediction and reward, Science, vol.275, pp.1593-1599, 1997. ,
The hippocampus as a predictive map, Nature neuroscience, vol.20, issue.11, p.1643, 2017. ,
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming, Proceedings of the seventh international conference on machine learning, pp.216-224, 1990. ,
Reinforcement Learning: An Introduction. Cambridge, 1998. ,
Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning, Frontiers in behavioral neuroscience 9, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01215419
Over the river, through the woods: cognitive maps in the hippocampus and orbitofrontal cortex, Nature Reviews Neuroscience, vol.17, issue.8, pp.513-523, 2016. ,
Reactivation of hippocampal ensemble memories during sleep, Science, vol.265, issue.5172, pp.676-679, 1994. ,
Complementary task structure representations in hippocampus and orbitofrontal cortex during an odor sequence task, Current Biology, vol.29, issue.20, pp.3402-3409, 2019. ,