TY - GEN
T1 - Social-Group-Agnostic Bias Mitigation via the Stereotype Content Model
AU - Omrani, Ali
AU - Ziabari, Alireza S.
AU - Yu, Charles
AU - Golazizian, Preni
AU - Kennedy, Brendan
AU - Atari, Mohammad
AU - Ji, Heng
AU - Dehghani, Morteza
N1 - We would like to thank our anonymous reviewers for their feedback. This research was supported by NSF CAREER BCS-1846531 and DARPA INCAS HR001121C0165. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of DARPA, or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright annotation therein.
PY - 2023
Y1 - 2023
N2 - Existing bias mitigation methods require social-group-specific word pairs (e.g., “man” - “woman”) for each social attribute (e.g., gender), restricting the bias mitigation to only one specified social attribute. Further, this constraint renders such methods impractical and costly for mitigating bias in understudied and/or unmarked social groups. We propose that the Stereotype Content Model (SCM) - a theoretical framework developed in social psychology for understanding the content of stereotyping - can help debiasing efforts to become social-group-agnostic by capturing the underlying connection between bias and stereotypes. SCM proposes that the content of stereotypes map to two psychological dimensions of warmth and competence. Using only pairs of terms for these two dimensions (e.g., warmth: “genuine” - “fake”; competence: “smart” - “stupid”), we perform debiasing with established methods on both pretrained word embeddings and large language models. We demonstrate that our social-group-agnostic, SCM-based debiasing technique performs comparably to group-specific debiasing on multiple bias benchmarks, but has theoretical and practical advantages over existing approaches.
AB - Existing bias mitigation methods require social-group-specific word pairs (e.g., “man” - “woman”) for each social attribute (e.g., gender), restricting the bias mitigation to only one specified social attribute. Further, this constraint renders such methods impractical and costly for mitigating bias in understudied and/or unmarked social groups. We propose that the Stereotype Content Model (SCM) - a theoretical framework developed in social psychology for understanding the content of stereotyping - can help debiasing efforts to become social-group-agnostic by capturing the underlying connection between bias and stereotypes. SCM proposes that the content of stereotypes map to two psychological dimensions of warmth and competence. Using only pairs of terms for these two dimensions (e.g., warmth: “genuine” - “fake”; competence: “smart” - “stupid”), we perform debiasing with established methods on both pretrained word embeddings and large language models. We demonstrate that our social-group-agnostic, SCM-based debiasing technique performs comparably to group-specific debiasing on multiple bias benchmarks, but has theoretical and practical advantages over existing approaches.
UR - http://www.scopus.com/inward/record.url?scp=85170264123&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85170264123&partnerID=8YFLogxK
U2 - 10.18653/v1/2023.acl-long.227
DO - 10.18653/v1/2023.acl-long.227
M3 - Conference contribution
AN - SCOPUS:85170264123
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 4123
EP - 4139
BT - Long Papers
PB - Association for Computational Linguistics (ACL)
T2 - 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
Y2 - 9 July 2023 through 14 July 2023
ER -