Jens Tuyls
Publications by Year
Research Areas
Topic Modeling, Adversarial Robustness in Machine Learning, Explainable Artificial Intelligence (XAI), Multi-Agent Systems and Negotiation, Reinforcement Learning in Robotics
Most-Cited Works
- → AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models(2019)125 cited
- → Gradient-based Analysis of NLP Models is Manipulable(2020)41 cited
- → Multi-Stage Episodic Control for Strategic Exploration in Text Games(2022)4 cited
- → Differentially Private Language Models Benefit from Public Pre-training(2020)3 cited
- → Language-guided World Models: A Model-based Approach to AI Control(2024)
- → Scaling Laws for Imitation Learning in Single-Agent Games(2023)