Red Teaming Language Models with Language Models
2022pp. 3419–3448
Citations Over TimeTop 1% of 2022 papers
Ethan Perez, Saffron Huang, Francis Song, Trevor Cai, Roman Ring, John Aslanides, Amelia Glaese, Nat McAleese, Geoffrey Irving
Abstract
Ethan Perez, Saffron Huang, Francis Song, Trevor Cai, Roman Ring, John Aslanides, Amelia Glaese, Nat McAleese, Geoffrey Irving. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022.
Related Papers
- → JastAdd—an aspect-oriented compiler construction system(2002)169 cited
- → Prolog - the language and its implementation compared with Lisp(1977)241 cited
- → On objects and events(2001)10 cited
- → Practical Verification for the Working Programmer with CodeContracts and Abstract Interpretation(2011)9 cited
- → A Survey on Large Scale Corpora and Emotion Corpora(2014)