SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals

Zijian Zhang; Vinay Setty; Avishek Anand

doi:10.48550/arXiv.2205.01588

Details

Originalsprache	Englisch
Titel des Sammelwerks	SIGIR 2022
Untertitel	Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
Seiten	3219-3223
Seitenumfang	5
ISBN (elektronisch)	9781450387323
Publikationsstatus	Veröffentlicht - 7 Juli 2022
Veranstaltung	45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 - Madrid, Spanien Dauer: 11 Juli 2022 → 15 Juli 2022

Abstract

We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.

ASJC Scopus Sachgebiete

Informatik (insg.)
Computergrafik und computergestütztes Design
Informatik (insg.)
Information systems
Informatik (insg.)
Software

Zitieren

SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals. / Zhang, Zijian; Setty, Vinay; Anand, Avishek.
SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022. S. 3219-3223.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Zhang, Z, Setty, V & Anand, A 2022, SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals. in SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. S. 3219-3223, 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022, Madrid, Spanien, 11 Juli 2022. https://doi.org/10.48550/arXiv.2205.01588, https://doi.org/10.1145/3477495.3531677

Zhang, Z., Setty, V., & Anand, A. (2022). SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals. In SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (S. 3219-3223) https://doi.org/10.48550/arXiv.2205.01588, https://doi.org/10.1145/3477495.3531677

Zhang Z, Setty V, Anand A. SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals. in SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022. S. 3219-3223 doi: 10.48550/arXiv.2205.01588, 10.1145/3477495.3531677

Zhang, Zijian ; Setty, Vinay ; Anand, Avishek. / SparCAssist : A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals. SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022. S. 3219-3223

Download

@inproceedings{c4365e9f870b46f184a001972087d438,

title = "SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals",

abstract = "We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.",

keywords = "counterfactual interpretation, data-annotation tools, human-in-the-loop machine learning, interpretable machine learning",

author = "Zijian Zhang and Vinay Setty and Avishek Anand",

note = "Funding Information: This work is partially funded by project MIRROR under grant agreement No. 832921 (project MIRROR from the European Commission: Migration-Related Risks caused by misconceptions of Opportunities and Requirement) and project ROXANNE, the European Union{\textquoteright}s Horizon 2020 research and innovation program under grant agreement No. 833635.; 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 ; Conference date: 11-07-2022 Through 15-07-2022",

year = "2022",

month = jul,

day = "7",

doi = "10.48550/arXiv.2205.01588",

language = "English",

pages = "3219--3223",

booktitle = "SIGIR 2022",

}

Download

TY - GEN

T1 - SparCAssist

T2 - 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022

AU - Zhang, Zijian

AU - Setty, Vinay

AU - Anand, Avishek

N1 - Funding Information: This work is partially funded by project MIRROR under grant agreement No. 832921 (project MIRROR from the European Commission: Migration-Related Risks caused by misconceptions of Opportunities and Requirement) and project ROXANNE, the European Union’s Horizon 2020 research and innovation program under grant agreement No. 833635.

PY - 2022/7/7

Y1 - 2022/7/7

N2 - We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.

AB - We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.

KW - counterfactual interpretation

KW - data-annotation tools

KW - human-in-the-loop machine learning

KW - interpretable machine learning

UR - http://www.scopus.com/inward/record.url?scp=85135007122&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2205.01588

DO - 10.48550/arXiv.2205.01588

M3 - Conference contribution

AN - SCOPUS:85135007122

SP - 3219

EP - 3223

BT - SIGIR 2022

Y2 - 11 July 2022 through 15 July 2022

ER -

Research@Leibniz University

SparCAssist: A Model Risk Assessment Assistant Based on Sparse Generated Counterfactuals

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren