Zac Hatfield-Dodds
Australian National University(AU)
Publications by Year
Research Areas
Topic Modeling, Service-Oriented Architecture and Web Services, Software Testing and Debugging Techniques, Natural Language Processing Techniques, Scientific Computing and Data Management
Most-Cited Works
- → The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package(2022)4,182 cited
- → Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
- → CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
- → Predictability and Surprise in Large Generative Models(2022)171 cited
- → Discovering Language Model Behaviors with Model-Written Evaluations(2023)119 cited
- → Hypothesis: A new approach to property-based testing(2019)89 cited
- → In-context Learning and Induction Heads(2022)84 cited
- → The Capacity for Moral Self-Correction in Large Language Models(2023)48 cited
- → Towards Measuring the Representation of Subjective Global Opinions in Language Models(2023)44 cited