Using subspace analysis for event detection from web click-through data

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • Nanyang Technological University (NTU)
View graph of relations

Details

Original languageEnglish
Title of host publicationProceeding of the 17th International Conference on World Wide Web 2008, WWW'08
PublisherAssociation for Computing Machinery (ACM)
Pages1067-1068
Number of pages2
ISBN (print)9781605580852
Publication statusPublished - 21 Apr 2008
Event17th International Conference on World Wide Web 2008, WWW'08 - Beijing, China
Duration: 21 Apr 200825 Apr 2008

Publication series

NameProceeding of the 17th International Conference on World Wide Web 2008, WWW'08

Abstract

Although most of existing research usually detects events by analyzing the content or structural information of Web documents, a recent direction is to study the usage data. In this paper, we focus on detecting events from Web click-through data generated by Web search engines. We propose a novel approach which effectively detects events from click-through data based on robust subspace analysis. We first transform click-through data to the 2D polar space. Next, an algorithm based on Generalized Principal Component Analysis (GPCA) is used to estimate subspaces of transformed data such that each subspace contains query sessions of similar topics. Then, we prune uninteresting subspaces which do not contain query sessions corresponding to real events by considering both the semantic certainty and the temporal certainty of query sessions in each subspace. Finally, various events are detected from interesting subspaces by utilizing a nonparametric clustering technique. Compared with existing approaches, our experimental results based on real-life click-through data have shown that the proposed approach is more accurate in detecting real events and more effective in determining the number of events.

Keywords

    Click-through data, Event detection, GPCA, Subspace estimation

ASJC Scopus subject areas

Cite this

Using subspace analysis for event detection from web click-through data. / Ling, Chen; Yiqun, Hu; Nejdl, Wolfgang.
Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. Association for Computing Machinery (ACM), 2008. p. 1067-1068 (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ling, C, Yiqun, H & Nejdl, W 2008, Using subspace analysis for event detection from web click-through data. in Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08, Association for Computing Machinery (ACM), pp. 1067-1068, 17th International Conference on World Wide Web 2008, WWW'08, Beijing, China, 21 Apr 2008. https://doi.org/10.1145/1367497.1367659
Ling, C., Yiqun, H., & Nejdl, W. (2008). Using subspace analysis for event detection from web click-through data. In Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08 (pp. 1067-1068). (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08). Association for Computing Machinery (ACM). https://doi.org/10.1145/1367497.1367659
Ling C, Yiqun H, Nejdl W. Using subspace analysis for event detection from web click-through data. In Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. Association for Computing Machinery (ACM). 2008. p. 1067-1068. (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08). doi: 10.1145/1367497.1367659
Ling, Chen ; Yiqun, Hu ; Nejdl, Wolfgang. / Using subspace analysis for event detection from web click-through data. Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. Association for Computing Machinery (ACM), 2008. pp. 1067-1068 (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08).
Download
@inproceedings{543bf65d7013415d8cfcbd9d36ecc27b,
title = "Using subspace analysis for event detection from web click-through data",
abstract = "Although most of existing research usually detects events by analyzing the content or structural information of Web documents, a recent direction is to study the usage data. In this paper, we focus on detecting events from Web click-through data generated by Web search engines. We propose a novel approach which effectively detects events from click-through data based on robust subspace analysis. We first transform click-through data to the 2D polar space. Next, an algorithm based on Generalized Principal Component Analysis (GPCA) is used to estimate subspaces of transformed data such that each subspace contains query sessions of similar topics. Then, we prune uninteresting subspaces which do not contain query sessions corresponding to real events by considering both the semantic certainty and the temporal certainty of query sessions in each subspace. Finally, various events are detected from interesting subspaces by utilizing a nonparametric clustering technique. Compared with existing approaches, our experimental results based on real-life click-through data have shown that the proposed approach is more accurate in detecting real events and more effective in determining the number of events.",
keywords = "Click-through data, Event detection, GPCA, Subspace estimation",
author = "Chen Ling and Hu Yiqun and Wolfgang Nejdl",
year = "2008",
month = apr,
day = "21",
doi = "10.1145/1367497.1367659",
language = "English",
isbn = "9781605580852",
series = "Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08",
publisher = "Association for Computing Machinery (ACM)",
pages = "1067--1068",
booktitle = "Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08",
address = "United States",
note = "17th International Conference on World Wide Web 2008, WWW'08 ; Conference date: 21-04-2008 Through 25-04-2008",

}

Download

TY - GEN

T1 - Using subspace analysis for event detection from web click-through data

AU - Ling, Chen

AU - Yiqun, Hu

AU - Nejdl, Wolfgang

PY - 2008/4/21

Y1 - 2008/4/21

N2 - Although most of existing research usually detects events by analyzing the content or structural information of Web documents, a recent direction is to study the usage data. In this paper, we focus on detecting events from Web click-through data generated by Web search engines. We propose a novel approach which effectively detects events from click-through data based on robust subspace analysis. We first transform click-through data to the 2D polar space. Next, an algorithm based on Generalized Principal Component Analysis (GPCA) is used to estimate subspaces of transformed data such that each subspace contains query sessions of similar topics. Then, we prune uninteresting subspaces which do not contain query sessions corresponding to real events by considering both the semantic certainty and the temporal certainty of query sessions in each subspace. Finally, various events are detected from interesting subspaces by utilizing a nonparametric clustering technique. Compared with existing approaches, our experimental results based on real-life click-through data have shown that the proposed approach is more accurate in detecting real events and more effective in determining the number of events.

AB - Although most of existing research usually detects events by analyzing the content or structural information of Web documents, a recent direction is to study the usage data. In this paper, we focus on detecting events from Web click-through data generated by Web search engines. We propose a novel approach which effectively detects events from click-through data based on robust subspace analysis. We first transform click-through data to the 2D polar space. Next, an algorithm based on Generalized Principal Component Analysis (GPCA) is used to estimate subspaces of transformed data such that each subspace contains query sessions of similar topics. Then, we prune uninteresting subspaces which do not contain query sessions corresponding to real events by considering both the semantic certainty and the temporal certainty of query sessions in each subspace. Finally, various events are detected from interesting subspaces by utilizing a nonparametric clustering technique. Compared with existing approaches, our experimental results based on real-life click-through data have shown that the proposed approach is more accurate in detecting real events and more effective in determining the number of events.

KW - Click-through data

KW - Event detection

KW - GPCA

KW - Subspace estimation

UR - http://www.scopus.com/inward/record.url?scp=57349100301&partnerID=8YFLogxK

U2 - 10.1145/1367497.1367659

DO - 10.1145/1367497.1367659

M3 - Conference contribution

AN - SCOPUS:57349100301

SN - 9781605580852

T3 - Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08

SP - 1067

EP - 1068

BT - Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08

PB - Association for Computing Machinery (ACM)

T2 - 17th International Conference on World Wide Web 2008, WWW'08

Y2 - 21 April 2008 through 25 April 2008

ER -

By the same author(s)