Amanda Askell
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Ethics and Social Impacts of AI, Explainable Artificial Intelligence (XAI), Adversarial Robustness in Machine Learning
Most-Cited Works
- → Learning Transferable Visual Models From Natural Language Supervision(2021)5,296 cited
- → Training language models to follow instructions with human feedback(2022)4,260 cited
- → Language Models are Few-Shot Learners(2020)3,027 cited
- → Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models(2022)548 cited
- → Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
- → CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
- → Release Strategies and the Social Impacts of Language Models(2019)283 cited
- → Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims(2020)212 cited
- → Predictability and Surprise in Large Generative Models(2022)171 cited
- → Language Models (Mostly) Know What They Know(2022)159 cited