Tom Everitt
Google (United Kingdom)(GB)
Publications by Year
Research Areas
Reinforcement Learning in Robotics, Bayesian Modeling and Causal Inference, Ethics and Social Impacts of AI, Computability, Logic, AI Algorithms, Logic, Reasoning, and Knowledge
Most-Cited Works
- → Scalable agent alignment via reward modeling: a research direction(2018)124 cited
- → AI Safety Gridworlds(2017)117 cited
- → Universal Artificial Intelligence(2018)66 cited
- → Alignment of Language Agents(2021)41 cited
- → Count-Based Exploration in Feature Space for Reinforcement Learning(2017)32 cited
- → Avoiding Wireheading with Value Reinforcement Learning(2016)28 cited
- → AGI Safety Literature Review(2018)26 cited
- → Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective(2021)22 cited
- → Self-Modification of Policy and Utility Function in Rational Agents(2016)20 cited
- → Shaking the foundations: delusions in sequence models for interaction\n and control(2021)19 cited