Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | SIGIR 2022 |
Untertitel | Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval |
Seiten | 3219-3223 |
Seitenumfang | 5 |
ISBN (elektronisch) | 9781450387323 |
Publikationsstatus | Veröffentlicht - 7 Juli 2022 |
Veranstaltung | 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 - Madrid, Spanien Dauer: 11 Juli 2022 → 15 Juli 2022 |
Abstract
We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.
ASJC Scopus Sachgebiete
- Informatik (insg.)
- Computergrafik und computergestütztes Design
- Informatik (insg.)
- Information systems
- Informatik (insg.)
- Software
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
SIGIR 2022 : Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022. S. 3219-3223.
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review
}
TY - GEN
T1 - SparCAssist
T2 - 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022
AU - Zhang, Zijian
AU - Setty, Vinay
AU - Anand, Avishek
N1 - Funding Information: This work is partially funded by project MIRROR under grant agreement No. 832921 (project MIRROR from the European Commission: Migration-Related Risks caused by misconceptions of Opportunities and Requirement) and project ROXANNE, the European Union’s Horizon 2020 research and innovation program under grant agreement No. 833635.
PY - 2022/7/7
Y1 - 2022/7/7
N2 - We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.
AB - We introduce SparCAssist, a general-purpose risk assessment tool for the machine learning models trained for language tasks. It evaluates models' risk by inspecting their behavior on counterfactuals, namely out-of-distribution instances generated based on the given data instance. The counterfactuals are generated by replacing tokens in rational subsequences identified by ExPred, while the replacements are retrieved using HotFlip or the Masked-Language-Model-based algorithms. The main purpose of our system is to help the human annotators to assess the model's risk on deployment. The counterfactual instances generated during the assessment are the by-product and can be used to train more robust NLP models in the future.
KW - counterfactual interpretation
KW - data-annotation tools
KW - human-in-the-loop machine learning
KW - interpretable machine learning
UR - http://www.scopus.com/inward/record.url?scp=85135007122&partnerID=8YFLogxK
U2 - 10.48550/arXiv.2205.01588
DO - 10.48550/arXiv.2205.01588
M3 - Conference contribution
AN - SCOPUS:85135007122
SP - 3219
EP - 3223
BT - SIGIR 2022
Y2 - 11 July 2022 through 15 July 2022
ER -