Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models

Maximilian Spliethöver; Henning Wachsmuth

doi:10.24963/ijcai.2021/77

Details

Originalsprache	Englisch
Titel des Sammelwerks	Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021
Herausgeber/-innen	Zhi-Hua Zhou
Seiten	552-559
Seitenumfang	8
Publikationsstatus	Veröffentlicht - 2021
Extern publiziert	Ja
Veranstaltung	30th International Joint Conference on Artificial Intelligence (IJCAI-21) - Online, Kanada Dauer: 19 Aug. 2021 → 26 Aug. 2021

Publikationsreihe

Name	IJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)	1045-0823

Abstract

Word embedding models reflect bias towards genders, ethnicities, and other social groups present in the underlying training data. Metrics such as ECT, RNSB, and WEAT quantify bias in these models based on predefined word lists representing social groups and bias-conveying concepts. How suitable these lists actually are to reveal bias-let alone the bias metrics in general-remains unclear, though. In this paper, we study how to assess the quality of bias metrics for word embedding models. In particular, we present a generic method, Bias Silhouette Analysis (BSA), that quantifies the accuracy and robustness of such a metric and of the word lists used. Given a biased and an unbiased reference embedding model, BSA applies the metric systematically for several subsets of the lists to the models. The variance and rate of convergence of the bias values of each model then entail the robustness of the word lists, whereas the distance between the models' values gives indications of the general accuracy of the metric with the word lists. We demonstrate the behavior of BSA on two standard embedding models for the three mentioned metrics with several word lists from existing research.

ASJC Scopus Sachgebiete

Informatik (insg.)
Artificial intelligence

Ziele für nachhaltige Entwicklung

SDG 10 – Weniger Ungleichheiten

Zitieren

Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models. / Spliethöver, Maximilian ; Wachsmuth, Henning.
Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. Hrsg. / Zhi-Hua Zhou. 2021. S. 552-559 (IJCAI International Joint Conference on Artificial Intelligence).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Spliethöver, M & Wachsmuth, H 2021, Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models. in Z-H Zhou (Hrsg.), Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. IJCAI International Joint Conference on Artificial Intelligence, S. 552-559, 30th International Joint Conference on Artificial Intelligence (IJCAI-21), Kanada, 19 Aug. 2021. https://doi.org/10.24963/ijcai.2021/77

Spliethöver, M., & Wachsmuth, H. (2021). Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models. In Z.-H. Zhou (Hrsg.), Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021 (S. 552-559). (IJCAI International Joint Conference on Artificial Intelligence). https://doi.org/10.24963/ijcai.2021/77

Spliethöver M , Wachsmuth H. Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models. in Zhou ZH, Hrsg., Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. 2021. S. 552-559. (IJCAI International Joint Conference on Artificial Intelligence). doi: 10.24963/ijcai.2021/77

Spliethöver, Maximilian ; Wachsmuth, Henning. / Bias Silhouette Analysis : Towards Assessing the Quality of Bias Metrics for Word Embedding Models. Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. Hrsg. / Zhi-Hua Zhou. 2021. S. 552-559 (IJCAI International Joint Conference on Artificial Intelligence).

Download

@inproceedings{a09c84551954464f9ede03099395838e,

title = "Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models",

abstract = "Word embedding models reflect bias towards genders, ethnicities, and other social groups present in the underlying training data. Metrics such as ECT, RNSB, and WEAT quantify bias in these models based on predefined word lists representing social groups and bias-conveying concepts. How suitable these lists actually are to reveal bias-let alone the bias metrics in general-remains unclear, though. In this paper, we study how to assess the quality of bias metrics for word embedding models. In particular, we present a generic method, Bias Silhouette Analysis (BSA), that quantifies the accuracy and robustness of such a metric and of the word lists used. Given a biased and an unbiased reference embedding model, BSA applies the metric systematically for several subsets of the lists to the models. The variance and rate of convergence of the bias values of each model then entail the robustness of the word lists, whereas the distance between the models' values gives indications of the general accuracy of the metric with the word lists. We demonstrate the behavior of BSA on two standard embedding models for the three mentioned metrics with several word lists from existing research.",

author = "Maximilian Splieth{\"o}ver and Henning Wachsmuth",

year = "2021",

doi = "10.24963/ijcai.2021/77",

language = "English",

isbn = "9780999241196",

series = "IJCAI International Joint Conference on Artificial Intelligence",

pages = "552--559",

editor = "Zhi-Hua Zhou",

booktitle = "Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021",

note = "30th International Joint Conference on Artificial Intelligence, IJCAI 2021 ; Conference date: 19-08-2021 Through 26-08-2021",

}

Download

TY - GEN

T1 - Bias Silhouette Analysis

T2 - 30th International Joint Conference on Artificial Intelligence, IJCAI 2021

AU - Spliethöver, Maximilian

AU - Wachsmuth, Henning

PY - 2021

Y1 - 2021

N2 - Word embedding models reflect bias towards genders, ethnicities, and other social groups present in the underlying training data. Metrics such as ECT, RNSB, and WEAT quantify bias in these models based on predefined word lists representing social groups and bias-conveying concepts. How suitable these lists actually are to reveal bias-let alone the bias metrics in general-remains unclear, though. In this paper, we study how to assess the quality of bias metrics for word embedding models. In particular, we present a generic method, Bias Silhouette Analysis (BSA), that quantifies the accuracy and robustness of such a metric and of the word lists used. Given a biased and an unbiased reference embedding model, BSA applies the metric systematically for several subsets of the lists to the models. The variance and rate of convergence of the bias values of each model then entail the robustness of the word lists, whereas the distance between the models' values gives indications of the general accuracy of the metric with the word lists. We demonstrate the behavior of BSA on two standard embedding models for the three mentioned metrics with several word lists from existing research.

AB - Word embedding models reflect bias towards genders, ethnicities, and other social groups present in the underlying training data. Metrics such as ECT, RNSB, and WEAT quantify bias in these models based on predefined word lists representing social groups and bias-conveying concepts. How suitable these lists actually are to reveal bias-let alone the bias metrics in general-remains unclear, though. In this paper, we study how to assess the quality of bias metrics for word embedding models. In particular, we present a generic method, Bias Silhouette Analysis (BSA), that quantifies the accuracy and robustness of such a metric and of the word lists used. Given a biased and an unbiased reference embedding model, BSA applies the metric systematically for several subsets of the lists to the models. The variance and rate of convergence of the bias values of each model then entail the robustness of the word lists, whereas the distance between the models' values gives indications of the general accuracy of the metric with the word lists. We demonstrate the behavior of BSA on two standard embedding models for the three mentioned metrics with several word lists from existing research.

UR - http://www.scopus.com/inward/record.url?scp=85125489618&partnerID=8YFLogxK

U2 - 10.24963/ijcai.2021/77

DO - 10.24963/ijcai.2021/77

M3 - Conference contribution

AN - SCOPUS:85125489618

SN - 9780999241196

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 552

EP - 559

BT - Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021

A2 - Zhou, Zhi-Hua

Y2 - 19 August 2021 through 26 August 2021

ER -

Research@Leibniz University

Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models

Autorschaft

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Ziele für nachhaltige Entwicklung

Zitieren

Von denselben Autoren

When to use a metaphor: Metaphors in dialogical explanations with addressees of different expertise

Improving Argument Effectiveness Across Ideologies using Instruction-tuned Large Language Models

Towards Modeling and Evaluating Instructional Explanations in Teacher-Student Dialogues

Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness

LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback