Ariel Herbert-Voss
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Ethics and Social Impacts of AI, Adversarial Robustness in Machine Learning, Parallel Computing and Optimization Techniques
Most-Cited Works
- → Language Models are Few-Shot Learners(2020)3,027 cited
- → Evaluating Large Language Models Trained on Code(2021)1,403 cited
- → Release Strategies and the Social Impacts of Language Models(2019)283 cited
- → Extracting Training Data from Large Language Models(2020)274 cited
- → Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims(2020)212 cited
- → Proceedings of the First Workshop on Intelligent and Interactive Writing Assistants (In2Writing 2022)(2022)15 cited
- → The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning(2024)13 cited
- → Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Tutorials(2021)1 cited
- → Ninth Workshop on Speech and Language Processing for Assistive Technologies (SLPAT-2022)(2022)1 cited