Miljan Martic
Publications by Year
Research Areas
Reinforcement Learning in Robotics, Explainable Artificial Intelligence (XAI), Adversarial Robustness in Machine Learning, Data Stream Mining Techniques, Artificial Intelligence in Games
Most-Cited Works
- → Deep reinforcement learning from human preferences(2017)508 cited
- → Scalable agent alignment via reward modeling: a research direction(2018)124 cited
- → AI Safety Gridworlds(2017)117 cited
- → Meta-trained agents implement Bayes-optimal agents(2020)24 cited
- → Penalizing side effects using stepwise relative reachability(2018)23 cited
- Measuring and avoiding side effects using relative reachability(2018)
- → Avoiding Side Effects By Considering Future Tasks(2020)11 cited
- → Algorithms for Causal Reasoning in Probability Trees(2020)10 cited
- → Causal Analysis of Agent Behavior for AI Safety(2021)6 cited