Csaba SzepesvC3A1ri

Home * People * Csaba Szepesvári

Csaba Szepesvári [1] Csaba Szepesvári,

a Hungarian computer scientiest with research interests in applications of statistical techniques in AI, and Reinforcement Learning [2]. Csaba Szepesvári worked at the Computer and Automation Research Institute of the Hungarian Academy of Sciences, and is professor at the Department of Computing Science, University of Alberta, and principal investigator of the RLAI [3] group, actually on leave at DeepMind.


In 2006, along with Levente Kocsis, Csaba Szepesvári introduced UCT (Upper Confidence bounds applied to Trees), a new algorithm that applies bandit ideas to guide Monte-Carlo planning [4]. UCT accelerated the Monte-Carlo revolution in computer Go [5] and other domains.

Selected Publications

[6] [7]

1994 …

  • Csaba Szepesvári, Lászlo Balázs, András Lõrincz (1994). Topology learning solved by extended objects: a neural network model. pdf
  • Csaba Szepesvári (1998). Reinforcement Learning: Theory and Practice. in Proceedings of the 2nd Slovak Conference on Artificial Neural Networks, zipped ps

2005 …

2010 …

2015 …

  • Tor Lattimore, Csaba Szepesvári (2017). The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits. AISTATS, pdf
  • Tor Lattimore, Csaba Szepesvári (2018). Cleaning up the neighborhood: A full classification for adversarial partial monitoring. arXiv:1805.09247
  • Tor Lattimore, Csaba Szepesvári (2019). Bandit Algorithms. Cambridge University Press (draft), pdf


  1. Homepage of Csaba Szepesvári
  2. Research Interests of Csaba Szepesvári
  3. Reinforcement Learning and Artificial Intelligence (RLAI)
  4. Levente Kocsis, Csaba Szepesvári (2006). Bandit based Monte-Carlo Planning
  5. Sylvain Gelly, Marc Schoenauer, Michèle Sebag, Olivier Teytaud, Levente Kocsis, David Silver, Csaba Szepesvári (2012). The Grand Challenge of Computer Go: Monte Carlo Tree Search and Extensions. Communications of the ACM, Vol. 55, No. 3, pdf preprint
  6. Publications of Csaba Szepesvári
  7. dblp: Csaba Szepesvári

Up one level