Progressive distributed top-k retrieval in peer-to-peer networks

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • University of California at Berkeley
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings
Subtitle of host publication21st International Conference on Data Engineering, ICDE 2005
Pages174-185
Number of pages12
Publication statusPublished - 18 Apr 2005
Event21st International Conference on Data Engineering, ICDE 2005 - Tokyo, Japan
Duration: 5 Apr 20058 Apr 2005

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Abstract

Query processing in traditional information management systems has moved from an exact match model to more flexible paradigms allowing cooperative retrieval by aggregating the database objects' degree of match for each different query predicate and returning the best matching objects only. In peer-to-peer systems such strategies are even more important, given the potentially large number of peers, which may contribute to the results. Yet current peer-to-peer research has barely started to investigate such approaches. In this paper we will discuss the benefits of best match/top-k queries in the context of distributed peer-to-peer information infrastructures and show how to extend the limited query processing in current peer-to-peer networks by allowing the distributed processing of top-k queries, while maintaining a minimum of data traffic. Relying on a super-peer backbone organized in the HyperCuP topology we will show how to use local indexes for optimizing the necessary query routing and how to process intermediate results in inner network nodes at the earliest possible point in time cutting down the necessary data traffic within the network. Our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers, as well as dynamic additions/deletions of peers. We will show our approach to always deliver correct result sets and to be optimal in terms of necessary object accesses and data traffic. Finally, we present simulation results for both static and dynamic network environments.

ASJC Scopus subject areas

Cite this

Progressive distributed top-k retrieval in peer-to-peer networks. / Balke, Wolf Tilo; Nejdl, Wolfgang; Siberski, Wolf et al.
Proceedings: 21st International Conference on Data Engineering, ICDE 2005. 2005. p. 174-185 (Proceedings - International Conference on Data Engineering).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Balke, WT, Nejdl, W, Siberski, W & Thaden, U 2005, Progressive distributed top-k retrieval in peer-to-peer networks. in Proceedings: 21st International Conference on Data Engineering, ICDE 2005. Proceedings - International Conference on Data Engineering, pp. 174-185, 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, 5 Apr 2005. https://doi.org/10.1109/ICDE.2005.115
Balke, W. T., Nejdl, W., Siberski, W., & Thaden, U. (2005). Progressive distributed top-k retrieval in peer-to-peer networks. In Proceedings: 21st International Conference on Data Engineering, ICDE 2005 (pp. 174-185). (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2005.115
Balke WT, Nejdl W, Siberski W, Thaden U. Progressive distributed top-k retrieval in peer-to-peer networks. In Proceedings: 21st International Conference on Data Engineering, ICDE 2005. 2005. p. 174-185. (Proceedings - International Conference on Data Engineering). doi: 10.1109/ICDE.2005.115
Balke, Wolf Tilo ; Nejdl, Wolfgang ; Siberski, Wolf et al. / Progressive distributed top-k retrieval in peer-to-peer networks. Proceedings: 21st International Conference on Data Engineering, ICDE 2005. 2005. pp. 174-185 (Proceedings - International Conference on Data Engineering).
Download
@inproceedings{b9690a98596c4a6e9f1c06db707f26cb,
title = "Progressive distributed top-k retrieval in peer-to-peer networks",
abstract = "Query processing in traditional information management systems has moved from an exact match model to more flexible paradigms allowing cooperative retrieval by aggregating the database objects' degree of match for each different query predicate and returning the best matching objects only. In peer-to-peer systems such strategies are even more important, given the potentially large number of peers, which may contribute to the results. Yet current peer-to-peer research has barely started to investigate such approaches. In this paper we will discuss the benefits of best match/top-k queries in the context of distributed peer-to-peer information infrastructures and show how to extend the limited query processing in current peer-to-peer networks by allowing the distributed processing of top-k queries, while maintaining a minimum of data traffic. Relying on a super-peer backbone organized in the HyperCuP topology we will show how to use local indexes for optimizing the necessary query routing and how to process intermediate results in inner network nodes at the earliest possible point in time cutting down the necessary data traffic within the network. Our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers, as well as dynamic additions/deletions of peers. We will show our approach to always deliver correct result sets and to be optimal in terms of necessary object accesses and data traffic. Finally, we present simulation results for both static and dynamic network environments.",
author = "Balke, {Wolf Tilo} and Wolfgang Nejdl and Wolf Siberski and Uwe Thaden",
year = "2005",
month = apr,
day = "18",
doi = "10.1109/ICDE.2005.115",
language = "English",
isbn = "0769522858",
series = "Proceedings - International Conference on Data Engineering",
pages = "174--185",
booktitle = "Proceedings",
note = "21st International Conference on Data Engineering, ICDE 2005 ; Conference date: 05-04-2005 Through 08-04-2005",

}

Download

TY - GEN

T1 - Progressive distributed top-k retrieval in peer-to-peer networks

AU - Balke, Wolf Tilo

AU - Nejdl, Wolfgang

AU - Siberski, Wolf

AU - Thaden, Uwe

PY - 2005/4/18

Y1 - 2005/4/18

N2 - Query processing in traditional information management systems has moved from an exact match model to more flexible paradigms allowing cooperative retrieval by aggregating the database objects' degree of match for each different query predicate and returning the best matching objects only. In peer-to-peer systems such strategies are even more important, given the potentially large number of peers, which may contribute to the results. Yet current peer-to-peer research has barely started to investigate such approaches. In this paper we will discuss the benefits of best match/top-k queries in the context of distributed peer-to-peer information infrastructures and show how to extend the limited query processing in current peer-to-peer networks by allowing the distributed processing of top-k queries, while maintaining a minimum of data traffic. Relying on a super-peer backbone organized in the HyperCuP topology we will show how to use local indexes for optimizing the necessary query routing and how to process intermediate results in inner network nodes at the earliest possible point in time cutting down the necessary data traffic within the network. Our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers, as well as dynamic additions/deletions of peers. We will show our approach to always deliver correct result sets and to be optimal in terms of necessary object accesses and data traffic. Finally, we present simulation results for both static and dynamic network environments.

AB - Query processing in traditional information management systems has moved from an exact match model to more flexible paradigms allowing cooperative retrieval by aggregating the database objects' degree of match for each different query predicate and returning the best matching objects only. In peer-to-peer systems such strategies are even more important, given the potentially large number of peers, which may contribute to the results. Yet current peer-to-peer research has barely started to investigate such approaches. In this paper we will discuss the benefits of best match/top-k queries in the context of distributed peer-to-peer information infrastructures and show how to extend the limited query processing in current peer-to-peer networks by allowing the distributed processing of top-k queries, while maintaining a minimum of data traffic. Relying on a super-peer backbone organized in the HyperCuP topology we will show how to use local indexes for optimizing the necessary query routing and how to process intermediate results in inner network nodes at the earliest possible point in time cutting down the necessary data traffic within the network. Our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers, as well as dynamic additions/deletions of peers. We will show our approach to always deliver correct result sets and to be optimal in terms of necessary object accesses and data traffic. Finally, we present simulation results for both static and dynamic network environments.

UR - http://www.scopus.com/inward/record.url?scp=28444484619&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2005.115

DO - 10.1109/ICDE.2005.115

M3 - Conference contribution

AN - SCOPUS:28444484619

SN - 0769522858

T3 - Proceedings - International Conference on Data Engineering

SP - 174

EP - 185

BT - Proceedings

T2 - 21st International Conference on Data Engineering, ICDE 2005

Y2 - 5 April 2005 through 8 April 2005

ER -

By the same author(s)