DivQ: Diversification for keyword search over structured databases

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • Commonwealth Scientific and Industrial Research Organisation (CSIRO)
View graph of relations

Details

Original languageEnglish
Title of host publicationSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages331-338
Number of pages8
Publication statusPublished - 9 Jul 2010
Event33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010 - Geneva, Switzerland
Duration: 19 Jul 201023 Jul 2010

Publication series

NameSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Abstract

Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.

Keywords

    Diversity, Query intent, Ranking in databases

ASJC Scopus subject areas

Cite this

DivQ: Diversification for keyword search over structured databases. / Demidova, Elena; Fankhauser, Peter; Zhou, Xuan et al.
SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2010. p. 331-338 (SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Demidova, E, Fankhauser, P, Zhou, X & Nejdl, W 2010, DivQ: Diversification for keyword search over structured databases. in SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 331-338, 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, 19 Jul 2010. https://doi.org/10.1145/1835449.1835506
Demidova, E., Fankhauser, P., Zhou, X., & Nejdl, W. (2010). DivQ: Diversification for keyword search over structured databases. In SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 331-338). (SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval). https://doi.org/10.1145/1835449.1835506
Demidova E, Fankhauser P, Zhou X, Nejdl W. DivQ: Diversification for keyword search over structured databases. In SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2010. p. 331-338. (SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.1145/1835449.1835506
Demidova, Elena ; Fankhauser, Peter ; Zhou, Xuan et al. / DivQ : Diversification for keyword search over structured databases. SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2010. pp. 331-338 (SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval).
Download
@inproceedings{d5725cd7c0584262a734b33982d4a466,
title = "DivQ: Diversification for keyword search over structured databases",
abstract = "Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.",
keywords = "Diversity, Query intent, Ranking in databases",
author = "Elena Demidova and Peter Fankhauser and Xuan Zhou and Wolfgang Nejdl",
year = "2010",
month = jul,
day = "9",
doi = "10.1145/1835449.1835506",
language = "English",
isbn = "9781605588964",
series = "SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval",
pages = "331--338",
booktitle = "SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval",
note = "33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010 ; Conference date: 19-07-2010 Through 23-07-2010",

}

Download

TY - GEN

T1 - DivQ

T2 - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010

AU - Demidova, Elena

AU - Fankhauser, Peter

AU - Zhou, Xuan

AU - Nejdl, Wolfgang

PY - 2010/7/9

Y1 - 2010/7/9

N2 - Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.

AB - Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.

KW - Diversity

KW - Query intent

KW - Ranking in databases

UR - http://www.scopus.com/inward/record.url?scp=77956014301&partnerID=8YFLogxK

U2 - 10.1145/1835449.1835506

DO - 10.1145/1835449.1835506

M3 - Conference contribution

AN - SCOPUS:77956014301

SN - 9781605588964

T3 - SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 331

EP - 338

BT - SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Y2 - 19 July 2010 through 23 July 2010

ER -

By the same author(s)