H. G. Wu, Y. R. Miyamoto, L. N. Gonzalez-castro, B. P. Ölveczky, and M. A. Smith, Temporal structure of motor variability is dynamically regulated and predicts motor learning ability, Nat. Neurosci, vol.17, pp.312-321, 2014.

D. Aronov, A. S. Andalman, and M. S. Fee, A specialized forebrain circuit for vocal babbling in the juvenile songbird, Science, vol.320, pp.630-634, 2008.

P. M. Driver, D. A. Humphries, and . Protean, , 1988.

A. Rapoport and D. V. Budescu, Generation of random series in two-person strictly competitive games, J. Exp. Psychol. Gen, vol.121, pp.352-363, 1992.

R. S. Sutton and A. G. Barto, Reinforcement Learning, 1998.
URL : https://hal.archives-ouvertes.fr/hal-00764281

W. Schultz, Getting formal with dopamine and reward, Neuron, vol.36, pp.241-263, 2002.

J. D. Cohen, S. M. Mcclure, and A. J. Yu, Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration, Philos. Trans. R. Soc. Lond., B, Biol. Sci, vol.362, pp.933-942, 2007.

R. P. Rao, Decision making under uncertainty: a neural model based on partially observable markov decision processes, Front. Comput. Neurosci, vol.4, p.146, 2010.

R. C. Wilson, A. Geana, J. M. White, E. A. Ludvig, and J. D. Cohen, Humans use directed and random exploration to solve the explore-exploit dilemma, J. Exp. Psychol. Gen, vol.143, pp.2074-2081, 2014.

F. A. Mansouri, E. Koechlin, M. G. Rosa, and M. J. Buckley, Managing competing goals -a key role for the frontopolar cortex, Nat. Rev. Neurosci, vol.18, pp.645-657, 2017.

A. Grunow and A. Neuringer, Learning to vary and varying to learn, Psychonomic Bull. Rev, vol.9, pp.250-258, 2002.

G. A. Kane, Increased locus coeruleus tonic activity causes disengagement from a patch-foraging task, Cogn. Affect Behav. Neurosci, vol.17, pp.1-11, 2017.

N. D. Daw, J. P. O'doherty, P. Dayan, B. Seymour, and R. J. Dolan, Cortical substrates for exploratory decisions in humans, Nature, vol.441, pp.876-879, 2006.

M. P. Karlsson, D. G. Tervo, and A. Y. Karpova, Network resets in medial prefrontal cortex mark the onset of behavioral uncertainty, Science, vol.338, pp.135-139, 2012.

C. Findling, V. Skvortsova, R. Dromnelle, S. Palminteri, and V. Wyart, Computational noise in reward-guided learning drives behavioral variability in volatile environments, Nat. Neurosci, vol.441, pp.876-888, 2019.

J. Naudé, Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking, Nat. Neurosci, vol.19, pp.471-478, 2016.

F. Cinotti, Dopamine regulates the exploration-exploitation trade-off in rats, pp.1-36, 2019.

D. Lee, M. L. Conroy, B. P. Mcgreevy, and D. J. Barraclough, Reinforcement learning and decision making in monkeys during a competitive game, Cogn. brain Res, vol.22, pp.45-58, 2004.

D. G. Tervo, Behavioral variability through stochastic choice and its gating by anterior cingulate cortex, Cell, vol.159, pp.21-32, 2014.

D. J. Barraclough, M. L. Conroy, and D. Lee, Prefrontal cortex and decision making in a mixed-strategy game, Nat. Neurosci, vol.7, pp.404-410, 2004.

A. Lempel and J. Ziv, On the complexity of finite sequences, IEEE Trans. Inf. Theory, vol.22, pp.75-81, 1976.

R. A. Rescorla and A. R. Wagner, A Theory of Pavlovian Conditioning: Variations in the Effectiveness of Reinforcement and Nonreinforcement, Classical conditioning II: current research and theory, pp.64-99, 1972.

P. W. Glimcher, Indeterminacy in brain and behavior, Annu Rev. Psychol, vol.56, pp.25-56, 2005.

J. N. Towse and A. Cheshire, Random number generation and working memory, Eur. J. Cogn. Psychol, vol.19, pp.374-394, 2007.

W. Oomens, J. H. Maes, F. Hasselman, and J. I. Egger, A time series approach to random number generation: using recurrence quantification analysis to capture executive behavior, Front. Hum. Neurosci, vol.9, p.319, 2015.

W. Wagenaar, Generation of random sequences by human subjects: a critical survey of literature, Psychological Bull, vol.77, pp.65-72, 1972.

J. H. Maes, P. A. Eling, M. F. Reelick, and R. P. Kessels, Assessing executive functioning: on the validity, reliability, and sensitivity of a click/ point random number generation task in healthy adults and patients with cognitive decline, J. Clin. Exp. Neuropsychol, vol.33, pp.366-378, 2011.

N. Marwan, M. C. Romano, M. Thiel, and J. Kurths, Recurrence plots for the analysis of complex systems, Phys. Rep, vol.438, pp.237-329, 2007.

P. Faure and A. Lesne, Recurrence plots for symbolic sequences, Int. J. Bifur. Chaos, vol.20, pp.1731-1749, 2010.

J. Bergstra and Y. Bengio, Random search for hyper-parameter optimization, J. Mach. Learn. Res, vol.13, pp.281-305, 2012.

M. Belkaid, Code for basic q-learning model fitting, 2019.

R. E. Kass and A. E. Raftery, Bayes factors. J. Am. Stat. Assoc, vol.90, pp.773-795, 1995.

M. Belkaid, Mice adaptively generate choice variability in a deterministic task -behavioral data, 2019.