Lihong Li
Hebei University of Engineering(CN)North China University of Science and Technology(CN)TED University(TR)Hangzhou Dianzi University(CN)Shanxi Datong University(CN)Shenyang Jianzhu University(CN)
Publications by Year
Research Areas
Advanced Bandit Algorithms Research, Reinforcement Learning in Robotics, Machine Learning and Algorithms, Topic Modeling, Optimization and Search Problems
Most-Cited Works
- → A contextual-bandit approach to personalized news article recommendation(2010)2,460 cited
- Parallelized Stochastic Gradient Descent(2010)
- An Empirical Evaluation of Thompson Sampling(2011)
- → Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms(2011)461 cited
- → Neural Approaches to Conversational AI(2018)383 cited
- → Doubly Robust Policy Evaluation and Learning(2011)302 cited
- → Sparse Online Learning via Truncated Gradient(2008)185 cited
- → Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning