XPROAX: Local explanations for text classification with progressive neighborhood approximation

Yi Cai; Arthur Zimek; Eirini Ntoutsi

doi:10.48550/arXiv.2109.15004

Details

Original language	English
Title of host publication	2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Number of pages	10
ISBN (electronic)	9781665420990
ISBN (print)	978-1-6654-2100-3
Publication status	Published - 2021
Event	8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 - Virtual, Online, Portugal Duration: 6 Oct 2021 → 9 Oct 2021

Publication series

Name	2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

Abstract

The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

Keywords

Counterfactuals, Explainable AI, Local explanations, Neighborhood approximation, Text classification

ASJC Scopus subject areas

Computer Science(all)
Computer Networks and Communications
Computer Science(all)
Signal Processing
Decision Sciences(all)
Information Systems and Management
Decision Sciences(all)
Statistics, Probability and Uncertainty

Cite this

XPROAX: Local explanations for text classification with progressive neighborhood approximation. / Cai, Yi; Zimek, Arthur; Ntoutsi, Eirini.
2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Cai, Y, Zimek, A & Ntoutsi, E 2021, XPROAX: Local explanations for text classification with progressive neighborhood approximation. in 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021, Institute of Electrical and Electronics Engineers Inc., 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021, Virtual, Online, Portugal, 6 Oct 2021. https://doi.org/10.48550/arXiv.2109.15004, https://doi.org/10.1109/DSAA53316.2021.9564153

Cai, Y., Zimek, A., & Ntoutsi, E. (2021). XPROAX: Local explanations for text classification with progressive neighborhood approximation. In 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021 (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2109.15004, https://doi.org/10.1109/DSAA53316.2021.9564153

Cai Y, Zimek A, Ntoutsi E. XPROAX: Local explanations for text classification with progressive neighborhood approximation. In 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc. 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021). doi: 10.48550/arXiv.2109.15004, 10.1109/DSAA53316.2021.9564153

Cai, Yi ; Zimek, Arthur ; Ntoutsi, Eirini. / XPROAX : Local explanations for text classification with progressive neighborhood approximation. 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021).

Download

@inproceedings{f80bed28556d4802b4a887a9249c65fb,

title = "XPROAX: Local explanations for text classification with progressive neighborhood approximation",

abstract = "The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).",

keywords = "Counterfactuals, Explainable AI, Local explanations, Neighborhood approximation, Text classification",

author = "Yi Cai and Arthur Zimek and Eirini Ntoutsi",

note = "Funding Information: The first author is supported by the State Ministry of Science and Culture of Lower Saxony, within the PhD program “Responsible Artificial Intelligence in the Digital Society”. We also thank Philip Naumann for the insightful discussions. ; 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 ; Conference date: 06-10-2021 Through 09-10-2021",

year = "2021",

doi = "10.48550/arXiv.2109.15004",

language = "English",

isbn = "978-1-6654-2100-3",

series = "2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021",

address = "United States",

}

Download

TY - GEN

T1 - XPROAX

T2 - 8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021

AU - Cai, Yi

AU - Zimek, Arthur

AU - Ntoutsi, Eirini

N1 - Funding Information: The first author is supported by the State Ministry of Science and Culture of Lower Saxony, within the PhD program “Responsible Artificial Intelligence in the Digital Society”. We also thank Philip Naumann for the insightful discussions.

PY - 2021

Y1 - 2021

N2 - The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

AB - The importance of the neighborhood for training a local surrogate model to approximate the local decision boundary of a black box classifier has been already highlighted in the literature. Several attempts have been made to construct a better neighborhood for high dimensional data, like texts, by using generative autoencoders. However, existing approaches mainly generate neighbors by selecting purely at random from the latent space and struggle under the curse of dimensionality to learn a good local decision boundary. To overcome this problem, we propose a progressive approximation of the neighborhood using counterfactual instances as initial landmarks and a careful 2-stage sampling approach to refine counterfactuals and generate factuals in the neighborhood of the input instance to be explained. Our work focuses on textual data and our explanations consist of both word-level explanations from the original instance (intrinsic) and the neighborhood (extrinsic) and factual- and counterfactual-instances discovered during the neighborhood generation process that further reveal the effect of altering certain parts in the input text. Our experiments on real-world datasets demonstrate that our method outperforms the competitors in terms of usefulness and stability (for the qualitative part) and completeness, compactness and correctness (for the quantitative part).

KW - Counterfactuals

KW - Explainable AI

KW - Local explanations

KW - Neighborhood approximation

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85126082401&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2109.15004

DO - 10.48550/arXiv.2109.15004

M3 - Conference contribution

AN - SCOPUS:85126082401

SN - 978-1-6654-2100-3

T3 - 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

BT - 2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 6 October 2021 through 9 October 2021

ER -

Research@Leibniz University

XPROAX: Local explanations for text classification with progressive neighborhood approximation

Authors

Research Organisations

External Research Organisations