Csaba Szepesvári
University of Alberta(CA)
Publications by Year
Research Areas
Advanced Bandit Algorithms Research, Reinforcement Learning in Robotics, Machine Learning and Algorithms, Optimization and Search Problems, Auction Theory and Applications
Most-Cited Works
- → Bandit Based Monte-Carlo Planning(2006)2,831 cited
- Improved Algorithms for Linear Stochastic Bandits(2011)
- → Bandit Algorithms(2020)822 cited
- → Algorithms for Reinforcement Learning(2010)750 cited
- → Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms(2000)618 cited
- → Exploration–exploitation tradeoff using variance estimates in multi-armed bandits(2009)558 cited
- → Fast gradient-descent methods for temporal-difference learning with linear function approximation(2009)528 cited
- Finite-Time Bounds for Fitted Value Iteration(2008)