XPROAX: Local explanations for text classification with progressive neighborhood approximation

Yi Cai; Arthur Zimek; Eirini Ntoutsi

doi:10.48550/arXiv.2109.15004

Details

Originalsprache	Englisch
Titel des Sammelwerks	2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021
Herausgeber (Verlag)	Institute of Electrical and Electronics Engineers Inc.
Seitenumfang	10
ISBN (elektronisch)	9781665420990
ISBN (Print)	978-1-6654-2100-3
Publikationsstatus	Veröffentlicht - 2021
Veranstaltung	8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 - Virtual, Online, Portugal Dauer: 6 Okt. 2021 → 9 Okt. 2021

Publikationsreihe

Name	2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

Abstract

The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

ASJC Scopus Sachgebiete

Informatik (insg.)
Computernetzwerke und -kommunikation
Informatik (insg.)
Signalverarbeitung
Entscheidungswissenschaften (insg.)
Informationssysteme und -management
Entscheidungswissenschaften (insg.)
Statistik, Wahrscheinlichkeit und Ungewissheit

Zitieren

XPROAX: Local explanations for text classification with progressive neighborhood approximation. / Cai, Yi; Zimek, Arthur; Ntoutsi, Eirini.
2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Cai, Y, Zimek, A & Ntoutsi, E 2021, XPROAX: Local explanations for text classification with progressive neighborhood approximation. in 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021, Institute of Electrical and Electronics Engineers Inc., 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021, Virtual, Online, Portugal, 6 Okt. 2021. https://doi.org/10.48550/arXiv.2109.15004, https://doi.org/10.1109/DSAA53316.2021.9564153

Cai, Y., Zimek, A., & Ntoutsi, E. (2021). XPROAX: Local explanations for text classification with progressive neighborhood approximation. In 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021 (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2109.15004, https://doi.org/10.1109/DSAA53316.2021.9564153

Cai Y, Zimek A, Ntoutsi E. XPROAX: Local explanations for text classification with progressive neighborhood approximation. in 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc. 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021). doi: 10.48550/arXiv.2109.15004, 10.1109/DSAA53316.2021.9564153

Cai, Yi ; Zimek, Arthur ; Ntoutsi, Eirini. / XPROAX : Local explanations for text classification with progressive neighborhood approximation. 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021).

Download

@inproceedings{f80bed28556d4802b4a887a9249c65fb,

title = "XPROAX: Local explanations for text classification with progressive neighborhood approximation",

abstract = "The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).",

keywords = "Counterfactuals, Explainable AI, Local explanations, Neighborhood approximation, Text classification",

author = "Yi Cai and Arthur Zimek and Eirini Ntoutsi",

note = "Funding Information: The first author is supported by the State Ministry of Science and Culture of Lower Saxony, within the PhD program “Responsible Artificial Intelligence in the Digital Society”. We also thank Philip Naumann for the insightful discussions. ; 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 ; Conference date: 06-10-2021 Through 09-10-2021",

year = "2021",

doi = "10.48550/arXiv.2109.15004",

language = "English",

isbn = "978-1-6654-2100-3",

series = "2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021",

address = "United States",

}

Download

TY - GEN

T1 - XPROAX

T2 - 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021

AU - Cai, Yi

AU - Zimek, Arthur

AU - Ntoutsi, Eirini

N1 - Funding Information: The first author is supported by the State Ministry of Science and Culture of Lower Saxony, within the PhD program “Responsible Artificial Intelligence in the Digital Society”. We also thank Philip Naumann for the insightful discussions.

PY - 2021

Y1 - 2021

N2 - The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

AB - The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

KW - Counterfactuals

KW - Explainable AI

KW - Local explanations

KW - Neighborhood approximation

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85126082401&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2109.15004

DO - 10.48550/arXiv.2109.15004

M3 - Conference contribution

AN - SCOPUS:85126082401

SN - 978-1-6654-2100-3

T3 - 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

BT - 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 6 October 2021 through 9 October 2021

ER -

Research@Leibniz University

XPROAX: Local explanations for text classification with progressive neighborhood approximation

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren