Catherine Olsson
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Explainable Artificial Intelligence (XAI), Assistive Technology in Communication and Mobility, Visual perception and processing mechanisms
Most-Cited Works
- → Dota 2 with Large Scale Deep Reinforcement Learning(2019)1,043 cited
- → Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
- → CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
- → Predictability and Surprise in Large Generative Models(2022)171 cited
- → Language Models (Mostly) Know What They Know(2022)159 cited
- → School Participation and Social Networks of Children with Complex Communication Needs, Physical Disabilities, and Typically Developing Peers(2012)132 cited
- → Discovering Language Model Behaviors with Model-Written Evaluations(2023)119 cited
- → Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned(2022)99 cited
- → In-context Learning and Induction Heads(2022)84 cited
- → Levothyroxine Treatment Reduces Thyroid Size in Children and Adolescents with Chronic Autoimmune Thyroiditis(2006)81 cited