Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Thi Huyen Nguyen
  • Koustav Rudra

Organisationseinheiten

Externe Organisationen

  • Indian School of Mines University
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksWWW '22
UntertitelProceedings of the ACM Web Conference 2022
Seiten3641-3650
Seitenumfang10
ISBN (elektronisch)9781450390965
PublikationsstatusVeröffentlicht - 25 Apr. 2022
Veranstaltung31st ACM World Wide Web Conference, WWW 2022 - Virtual, Online, Frankreich
Dauer: 25 Apr. 202229 Apr. 2022

Abstract

Microblogging platforms like Twitter have been heavily leveraged to report and exchange information about natural disasters. The real-time data on these sites is highly helpful in gaining situational awareness and planning aid efforts. However, disaster-related messages are immersed in a high volume of irrelevant information. The situational data of disaster events also vary greatly in terms of information types ranging from general situational awareness (caution, infrastructure damage, casualties) to individual needs or not related to the crisis. It thus requires efficient methods to handle data overload and prioritize various types of information. This paper proposes an interpretable classification-summarization framework that first classifies tweets into different disaster-related categories and then summarizes those tweets. Unlike existing work, our classification model can provide explanations or rationales for its decisions. In the summarization phase, we employ an Integer Linear Programming (ILP) based optimization technique along with the help of rationales to generate summaries of event categories. Extensive evaluation on large-scale disaster events shows (a). our model can classify tweets into disaster-related categories with an 85% Macro F1 score and high interpretability (b). the summarizer achieves (5-25%) improvement in terms of ROUGE-1 F-score over most state-of-the-art approaches.

ASJC Scopus Sachgebiete

Zitieren

Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs. / Nguyen, Thi Huyen; Rudra, Koustav.
WWW '22: Proceedings of the ACM Web Conference 2022. 2022. S. 3641-3650.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Nguyen, TH & Rudra, K 2022, Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs. in WWW '22: Proceedings of the ACM Web Conference 2022. S. 3641-3650, 31st ACM World Wide Web Conference, WWW 2022, Virtual, Online, Frankreich, 25 Apr. 2022. https://doi.org/10.1145/3485447.3512259
Nguyen, T. H., & Rudra, K. (2022). Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs. In WWW '22: Proceedings of the ACM Web Conference 2022 (S. 3641-3650) https://doi.org/10.1145/3485447.3512259
Nguyen TH, Rudra K. Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs. in WWW '22: Proceedings of the ACM Web Conference 2022. 2022. S. 3641-3650 doi: 10.1145/3485447.3512259
Nguyen, Thi Huyen ; Rudra, Koustav. / Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs. WWW '22: Proceedings of the ACM Web Conference 2022. 2022. S. 3641-3650
Download
@inproceedings{f07c5ac09813436fa06b5949c732d6e3,
title = "Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs",
abstract = "Microblogging platforms like Twitter have been heavily leveraged to report and exchange information about natural disasters. The real-time data on these sites is highly helpful in gaining situational awareness and planning aid efforts. However, disaster-related messages are immersed in a high volume of irrelevant information. The situational data of disaster events also vary greatly in terms of information types ranging from general situational awareness (caution, infrastructure damage, casualties) to individual needs or not related to the crisis. It thus requires efficient methods to handle data overload and prioritize various types of information. This paper proposes an interpretable classification-summarization framework that first classifies tweets into different disaster-related categories and then summarizes those tweets. Unlike existing work, our classification model can provide explanations or rationales for its decisions. In the summarization phase, we employ an Integer Linear Programming (ILP) based optimization technique along with the help of rationales to generate summaries of event categories. Extensive evaluation on large-scale disaster events shows (a). our model can classify tweets into disaster-related categories with an 85% Macro F1 score and high interpretability (b). the summarizer achieves (5-25%) improvement in terms of ROUGE-1 F-score over most state-of-the-art approaches.",
keywords = "Classification, Crisis Events, Interpretability, Summarization",
author = "Nguyen, {Thi Huyen} and Koustav Rudra",
note = "Funding Information: This work was partially funded by the DFG Grant NI-1760/1-1 Funding Information: This material is based upon work supported by ONR (N00014-21-1-4002), and the U.S. Department of Homeland Security under Grant Award Number 17STQAC00001-05-003. Kai Shu is supported by the NSF award #2109316.; 31st ACM World Wide Web Conference, WWW 2022 ; Conference date: 25-04-2022 Through 29-04-2022",
year = "2022",
month = apr,
day = "25",
doi = "10.1145/3485447.3512259",
language = "English",
pages = "3641--3650",
booktitle = "WWW '22",

}

Download

TY - GEN

T1 - Towards an Interpretable Approach to Classify and Summarize Crisis Events from Microblogs

AU - Nguyen, Thi Huyen

AU - Rudra, Koustav

N1 - Funding Information: This work was partially funded by the DFG Grant NI-1760/1-1 Funding Information: This material is based upon work supported by ONR (N00014-21-1-4002), and the U.S. Department of Homeland Security under Grant Award Number 17STQAC00001-05-003. Kai Shu is supported by the NSF award #2109316.

PY - 2022/4/25

Y1 - 2022/4/25

N2 - Microblogging platforms like Twitter have been heavily leveraged to report and exchange information about natural disasters. The real-time data on these sites is highly helpful in gaining situational awareness and planning aid efforts. However, disaster-related messages are immersed in a high volume of irrelevant information. The situational data of disaster events also vary greatly in terms of information types ranging from general situational awareness (caution, infrastructure damage, casualties) to individual needs or not related to the crisis. It thus requires efficient methods to handle data overload and prioritize various types of information. This paper proposes an interpretable classification-summarization framework that first classifies tweets into different disaster-related categories and then summarizes those tweets. Unlike existing work, our classification model can provide explanations or rationales for its decisions. In the summarization phase, we employ an Integer Linear Programming (ILP) based optimization technique along with the help of rationales to generate summaries of event categories. Extensive evaluation on large-scale disaster events shows (a). our model can classify tweets into disaster-related categories with an 85% Macro F1 score and high interpretability (b). the summarizer achieves (5-25%) improvement in terms of ROUGE-1 F-score over most state-of-the-art approaches.

AB - Microblogging platforms like Twitter have been heavily leveraged to report and exchange information about natural disasters. The real-time data on these sites is highly helpful in gaining situational awareness and planning aid efforts. However, disaster-related messages are immersed in a high volume of irrelevant information. The situational data of disaster events also vary greatly in terms of information types ranging from general situational awareness (caution, infrastructure damage, casualties) to individual needs or not related to the crisis. It thus requires efficient methods to handle data overload and prioritize various types of information. This paper proposes an interpretable classification-summarization framework that first classifies tweets into different disaster-related categories and then summarizes those tweets. Unlike existing work, our classification model can provide explanations or rationales for its decisions. In the summarization phase, we employ an Integer Linear Programming (ILP) based optimization technique along with the help of rationales to generate summaries of event categories. Extensive evaluation on large-scale disaster events shows (a). our model can classify tweets into disaster-related categories with an 85% Macro F1 score and high interpretability (b). the summarizer achieves (5-25%) improvement in terms of ROUGE-1 F-score over most state-of-the-art approaches.

KW - Classification

KW - Crisis Events

KW - Interpretability

KW - Summarization

UR - http://www.scopus.com/inward/record.url?scp=85129805838&partnerID=8YFLogxK

U2 - 10.1145/3485447.3512259

DO - 10.1145/3485447.3512259

M3 - Conference contribution

AN - SCOPUS:85129805838

SP - 3641

EP - 3650

BT - WWW '22

T2 - 31st ACM World Wide Web Conference, WWW 2022

Y2 - 25 April 2022 through 29 April 2022

ER -