Explain and Predict, and then Predict Again

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Zijian Zhang
  • Koustav Rudra
  • Avishek Anand

Organisationseinheiten

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksWSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
Seiten418-426
Seitenumfang9
ISBN (elektronisch)9781450382977
PublikationsstatusVeröffentlicht - 8 März 2021
Veranstaltung14th ACM International Conference on Web Search and Data Mining - online, Virtual, Online, Israel
Dauer: 8 März 202112 März 2021

Abstract

A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

ASJC Scopus Sachgebiete

Zitieren

Explain and Predict, and then Predict Again. / Zhang, Zijian; Rudra, Koustav; Anand, Avishek.
WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. S. 418-426.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Zhang, Z, Rudra, K & Anand, A 2021, Explain and Predict, and then Predict Again. in WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. S. 418-426, 14th ACM International Conference on Web Search and Data Mining, Virtual, Online, Israel, 8 März 2021. https://doi.org/10.1145/3437963.3441758
Zhang, Z., Rudra, K., & Anand, A. (2021). Explain and Predict, and then Predict Again. In WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining (S. 418-426) https://doi.org/10.1145/3437963.3441758
Zhang Z, Rudra K, Anand A. Explain and Predict, and then Predict Again. in WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. S. 418-426 doi: 10.1145/3437963.3441758
Zhang, Zijian ; Rudra, Koustav ; Anand, Avishek. / Explain and Predict, and then Predict Again. WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. S. 418-426
Download
@inproceedings{2250670cb9aa4997bb9925f651b2eebb,
title = "Explain and Predict, and then Predict Again",
abstract = "A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.",
keywords = "explanation, interpretable by design, learning with rationales, machine learning interpretation, multitask learning",
author = "Zijian Zhang and Koustav Rudra and Avishek Anand",
note = "Funding Information: Funding for this project was in part provided by the European Union's Horizon 2020 research and innovation program under grant agreement No 832921, and No 871042.; 14th ACM International Conference on Web Search and Data Mining, WSDM 2021 ; Conference date: 08-03-2021 Through 12-03-2021",
year = "2021",
month = mar,
day = "8",
doi = "10.1145/3437963.3441758",
language = "English",
pages = "418--426",
booktitle = "WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining",

}

Download

TY - GEN

T1 - Explain and Predict, and then Predict Again

AU - Zhang, Zijian

AU - Rudra, Koustav

AU - Anand, Avishek

N1 - Funding Information: Funding for this project was in part provided by the European Union's Horizon 2020 research and innovation program under grant agreement No 832921, and No 871042.

PY - 2021/3/8

Y1 - 2021/3/8

N2 - A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

AB - A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

KW - explanation

KW - interpretable by design

KW - learning with rationales

KW - machine learning interpretation

KW - multitask learning

UR - http://www.scopus.com/inward/record.url?scp=85103007189&partnerID=8YFLogxK

U2 - 10.1145/3437963.3441758

DO - 10.1145/3437963.3441758

M3 - Conference contribution

AN - SCOPUS:85103007189

SP - 418

EP - 426

BT - WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining

T2 - 14th ACM International Conference on Web Search and Data Mining, WSDM 2021

Y2 - 8 March 2021 through 12 March 2021

ER -