Christopher Olah
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Explainable Artificial Intelligence (XAI), Ethics and Social Impacts of AI, Model Reduction and Neural Networks
Most-Cited Works
- → Conditional Image Synthesis With Auxiliary Classifier GANs(2016)2,070 cited
- Inceptionism: Going Deeper into Neural Networks(2015)
- → CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
- → Document Embedding with Paragraph Vectors(2015)266 cited
- → Predictability and Surprise in Large Generative Models(2022)171 cited
- → Discovering Language Model Behaviors with Model-Written Evaluations(2023)119 cited
- → Is Generator Conditioning Causally Related to GAN Performance?(2018)50 cited
- → The Capacity for Moral Self-Correction in Large Language Models(2023)48 cited
- → Toy Models of Superposition(2022)37 cited