0 citations0 references

Measuring and Reducing Gendered Correlations in Pre-trained Models

arXiv (Cornell University)2020

Citations Over Time

Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel, Emily Pitler, Ellie Pavlick, Jilin Chen, Chi, Ed, Petrov, Slav

Abstract

Pre-trained models have revolutionized natural language understanding. However, researchers have found they can encode artifacts undesired in many applications, such as professions correlating with one gender more than another. We explore such gendered correlations as a case study for how to address unintended correlations in pre-trained models. We define metrics and reveal that it is possible for models with similar accuracy to encode correlations at very different rates. We show how measured correlations can be reduced with general-purpose techniques, and highlight the trade offs different strategies have. With these results, we make recommendations for training robust models: (1) carefully evaluate unintended correlations, (2) be mindful of seemingly innocuous configuration differences, and (3) focus on general mitigations.

Related Papers

→ Personality correlates and utilitarian judgments in the everyday context: Psychopathic traits and differential effects of empathy, social dominance orientation, and dehumanization beliefs(2019)19 cited
→ Amount of altruistic punishment accounts for subsequent emotional gratification in participants with primary psychopathy(2011)14 cited
Susquehanna Chorale Spring Concert "Roots and Wings"(2017)
Mediating Role of Unconditional Self-acceptance in Relationship Between College Students’ Perfectionism and Depression(2011)
→ ИСПОЛЬЗОВAНИЕ ПОТЕНЦИAЛA СОЦИAЛЬНЫХ ПAРТНЕРОВ В ПОДГОТОВКЕ БУДУЩИХ ПЕДAГОГОВ(2024)