E. Agichtein, E. Brill, and S. Dumais, Improving web search ranking by incorporating user behavior information, SIGIR '06, pp.19-26, 2006.

J. Baxter, L. Weaver, and P. Bartlett, Direct gradient-based reinforcement learning: Ii. gradient ascent algorithms and experiments, 1999.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, Enriching word vectors with subword information, Transactions of the Association for Computational Linguistics, vol.5, pp.135-146, 2017.

A. Bordes and J. Weston, Learning end-to-end goal-oriented dialog, 2016.

M. Burtsev, A. Chuklin, J. Kiseleva, and A. Borisov, Search-oriented conversational ai (scai), ICTIR '17, pp.333-334, 2017.

B. Dhingra, L. Li, X. Li, J. Gao, Y. Chen et al., Towards end-to-end reinforcement learning of dialogue agents for information access, ACL' 17, pp.484-495, 2017.

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural Comput, vol.9, issue.8, pp.1735-1780, 1997.

R. Hoffmann, C. Zhang, X. Ling, L. Zettlemoyer, and D. S. Weld, Knowledge-based weak supervision for information extraction of overlapping relations, HLT '11, pp.541-550, 2011.

T. Joachims, Optimizing Search Engines Using Clickthrough Data, SIGKDD '02, pp.133-142, 2002.

H. Joho, L. Cavedon, J. Arguello, M. Shokouhi, and F. Radlinski, Cair'17: First international workshop on conversational approaches to information retrieval at sigir 2017. SI-GIR Forum, vol.51, pp.114-121, 2018.

P. Diederik, J. Kingma, and . Ba, Adam: A method for stochastic optimization, 2014.

G. Lample, L. Denoyer, and M. Ranzato, Unsupervised machine translation using monolingual corpora only, 2017.

J. Li, M. Galley, C. Brockett, J. Gao, and B. Dolan, A diversity-promoting objective function for neural conversation models, HLT '16, pp.110-119, 2016.

Y. Lin, Z. Liu, M. Sun, Y. Liu, and X. Zhu, Learning entity and relation embeddings for knowledge graph completion, AAAI, pp.2181-2187, 2015.

R. Nogueira and K. Cho, Taskoriented query reformulation with reinforcement learning, SCAI Workshop -ICTIR, 2017.

A. Ritter, C. Cherry, and W. B. Dolan, Data-driven response generation in social media, EMNLP '11, 2011.

H. Song, A. Kim, and S. Park, Translation of natural language query into keyword query using a rnn encoder-decoder, SI-GIR '17, pp.965-968, 2017.

S. Vakulenko, I. Markov, and M. De-rijke, Conversational exploratory search via interactive storytelling, NEUIR SIGIR'17, 2017.

Z. Wang and O. Lemon, A simple and generic belief tracking mechanism for the dialog state tracking challenge: On the believability of observed information, SIGDIAL' 13, pp.423-432, 2013.

Z. Yin, K. Chang, and R. Zhang, Deepprobe: Information directed sequence understanding and chatbot design via recurrent neural networks, SIGKDD' 17, pp.2131-2139, 2017.