a public good project by the
Synthesis
Company
of California

© 2026

Tom Everitt | doi.page

0 works0 citations0 h-index

Google Scholar OpenAlex

Tom Everitt

Google (United Kingdom)(GB)

Publications by Year

Research Areas

Reinforcement Learning in Robotics, Bayesian Modeling and Causal Inference, Ethics and Social Impacts of AI, Computability, Logic, AI Algorithms, Logic, Reasoning, and Knowledge

Most-Cited Works

→ Scalable agent alignment via reward modeling: a research direction(2018)124 cited
→ AI Safety Gridworlds(2017)117 cited
→ Universal Artificial Intelligence(2018)66 cited
→ Alignment of Language Agents(2021)41 cited
→ Count-Based Exploration in Feature Space for Reinforcement Learning(2017)32 cited
→ Avoiding Wireheading with Value Reinforcement Learning(2016)28 cited
→ AGI Safety Literature Review(2018)26 cited
→ Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective(2021)22 cited
→ Self-Modification of Policy and Utility Function in Rational Agents(2016)20 cited
→ Shaking the foundations: delusions in sequence models for interaction\n and control(2021)19 cited