Details
Original language | English |
---|---|
Title of host publication | WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web |
Pages | 1335-1342 |
Number of pages | 8 |
Publication status | Published - 13 May 2013 |
Event | 22nd International Conference on World Wide Web - Rio de Janeiro, Brazil Duration: 13 May 2013 → 17 May 2013 Conference number: 22 |
Publication series
Name | WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web |
---|
Abstract
A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.
Keywords
- Event detection, Outbreak events, Temporal diversity, Twitter, Web observatory
ASJC Scopus subject areas
- Computer Science(all)
- Computer Networks and Communications
Sustainable Development Goals
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web. 2013. p. 1335-1342 (WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - Understanding the Diversity of Tweets in the Time of Outbreaks
AU - Kanhabua, Nattiya
AU - Nejdl, Wolfgang
N1 - Conference code: 22
PY - 2013/5/13
Y1 - 2013/5/13
N2 - A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.
AB - A microblogging service like Twitter continues to surge in importance as a means of sharing information in social networks. In the medical domain, several works have shown the potential of detecting public health events (i.e., infectious disease outbreaks) using Twitter messages or tweets. Given its real-time nature, Twitter can enhance early outbreak warning for public health authorities in order that a rapid response can take place. Most of previous works on detecting outbreaks in Twitter simply analyze tweets matched disease names and/or locations of interests. However, the effectiveness of such method is limited for two main reasons. First, disease names are highly ambiguous, i.e., referring slangs or non health-related contexts. Second, the characteristics of infectious diseases are highly dynamic in time and place, namely, strongly time-dependent and vary greatly among different regions. In this paper, we propose to analyze the temporal diversity of tweets during the known periods of real-world outbreaks in order to gain insight into a temporary focus on specific events. More precisely, our objective is to understand whether the temporal diversity of tweets can be used as indicators of outbreak events, and to which extent. We employ an efficient algorithm based on sampling to compute the diversity statistics of tweets at particular time. To this end, we conduct experiments by correlating temporal diversity with the estimated event magnitude of 14 real-world outbreak events manually created as ground truth. Our analysis shows that correlation results are diverse among different outbreaks, which can reflect the characteristics (severity and duration) of outbreaks.
KW - Event detection
KW - Outbreak events
KW - Temporal diversity
KW - Twitter
KW - Web observatory
UR - http://www.scopus.com/inward/record.url?scp=84893154676&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84893154676
SN - 9781450320382
T3 - WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web
SP - 1335
EP - 1342
BT - WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web
T2 - 22nd International Conference on World Wide Web
Y2 - 13 May 2013 through 17 May 2013
ER -