Nova DasSarma
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Multimodal Machine Learning Applications, Explainable Artificial Intelligence (XAI), Neural Networks and Applications
Most-Cited Works
- → Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
- → In-context Learning and Induction Heads(2022)84 cited
- → A General Language Assistant as a Laboratory for Alignment(2021)27 cited
- → Scaling Laws and Interpretability of Learning from Repeated Data(2022)22 cited
- → Predictability and Surprise in Large Generative Models(2022)21 cited