Valentin Hofmann
Publications by Year
Research Areas
Natural Language Processing Techniques, Topic Modeling, Language and cultural evolution, Social Media and Politics, Speech and dialogue systems
Most-Cited Works
- → AI generates covertly racist decisions about people based on their dialect(2024)133 cited
- → Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research(2024)33 cited
- → Dialect prejudice predicts AI decisions about people's character, employability, and criminality(2024)30 cited
- → Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models(2024)26 cited
- → An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers(2022)18 cited
- → Predicting the Growth of Morphological Families from Social and Linguistic Factors(2020)13 cited
- → The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative(2022)13 cited
- → The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse(2022)11 cited
- → Counting the Bugs in ChatGPT’s Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model(2023)10 cited
- → A Graph Auto-encoder Model of Derivational Morphology(2020)10 cited