Understanding the Diversity of Tweets in the Time of Outbreaks

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationWWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web
Pages1335-1342
Number of pages8
Publication statusPublished - 13 May 2013
Event22nd International Conference on World Wide Web - Rio de Janeiro, Brazil
Duration: 13 May 201317 May 2013
Conference number: 22

Publication series

NameWWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web

Abstract

A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.

Keywords

    Event detection, Outbreak events, Temporal diversity, Twitter, Web observatory

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Understanding the Diversity of Tweets in the Time of Outbreaks. / Kanhabua, Nattiya; Nejdl, Wolfgang.
WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web. 2013. p. 1335-1342 (WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Kanhabua, N & Nejdl, W 2013, Understanding the Diversity of Tweets in the Time of Outbreaks. in WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web. WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web, pp. 1335-1342, 22nd International Conference on World Wide Web, Rio de Janeiro, Brazil, 13 May 2013. <https://dl.acm.org/doi/pdf/10.1145/2487788.2488172>
Kanhabua, N., & Nejdl, W. (2013). Understanding the Diversity of Tweets in the Time of Outbreaks. In WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web (pp. 1335-1342). (WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web). https://dl.acm.org/doi/pdf/10.1145/2487788.2488172
Kanhabua N, Nejdl W. Understanding the Diversity of Tweets in the Time of Outbreaks. In WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web. 2013. p. 1335-1342. (WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web).
Kanhabua, Nattiya ; Nejdl, Wolfgang. / Understanding the Diversity of Tweets in the Time of Outbreaks. WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web. 2013. pp. 1335-1342 (WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web).
Download
@inproceedings{3ca4a4a9ccee4dfbba48b8596294e35d,
title = "Understanding the Diversity of Tweets in the Time of Outbreaks",
abstract = "A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.",
keywords = "Event detection, Outbreak events, Temporal diversity, Twitter, Web observatory",
author = "Nattiya Kanhabua and Wolfgang Nejdl",
year = "2013",
month = may,
day = "13",
language = "English",
isbn = "9781450320382",
series = "WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web",
pages = "1335--1342",
booktitle = "WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web",
note = "22nd International Conference on World Wide Web, WWW 2013 ; Conference date: 13-05-2013 Through 17-05-2013",

}

Download

TY - GEN

T1 - Understanding the Diversity of Tweets in the Time of Outbreaks

AU - Kanhabua, Nattiya

AU - Nejdl, Wolfgang

N1 - Conference code: 22

PY - 2013/5/13

Y1 - 2013/5/13

N2 - A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.

AB - A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.

KW - Event detection

KW - Outbreak events

KW - Temporal diversity

KW - Twitter

KW - Web observatory

UR - http://www.scopus.com/inward/record.url?scp=84893154676&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84893154676

SN - 9781450320382

T3 - WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web

SP - 1335

EP - 1342

BT - WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web

T2 - 22nd International Conference on World Wide Web

Y2 - 13 May 2013 through 17 May 2013

ER -

By the same author(s)