0 works0 citations0 h-index

Amanda Askell

Publications by Year

Research Areas

Topic Modeling, Natural Language Processing Techniques, Ethics and Social Impacts of AI, Explainable Artificial Intelligence (XAI), Adversarial Robustness in Machine Learning

Most-Cited Works

→ Learning Transferable Visual Models From Natural Language Supervision(2021)5,296 cited
→ Training language models to follow instructions with human feedback(2022)4,260 cited
→ Language Models are Few-Shot Learners(2020)3,027 cited
→ Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models(2022)548 cited
→ Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
→ CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
→ Release Strategies and the Social Impacts of Language Models(2019)283 cited
→ Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims(2020)212 cited
→ Predictability and Surprise in Large Generative Models(2022)171 cited
→ Language Models (Mostly) Know What They Know(2022)159 cited