Tempas: Temporal Archive Search Based on Tags

Research output: Chapter in book/report/conference proceedingConference contributionResearch

Authors

  • Helge Holzmann
  • Avishek Anand

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationWWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web
Pages207-210
Number of pages4
ISBN (electronic)9781450341448
Publication statusPublished - 2016
Event25th World Wide Web Conference - Montréal, Canada
Duration: 11 Apr 201615 Apr 2016

Abstract

Limited search and access patterns over Web archives have been well documented. One of the key reasons is the lack of understanding of the user access patterns over such collections, which in turn is attributed to the lack of effective search interfaces. Current search interfaces for Web archives are (a) either purely navigational or (b) have sub-optimal search experience due to ineffective retrieval models or query modeling. We identify that external longitudinal resources, such as social bookmarking data, are crucial sources to identify important and popular websites in the past. To this extent we present Tempas, a tag-based temporal search engine for Web archives. Websites are posted at specific times of interest on several external platforms, such as bookmarking sites like Delicious. Attached tags not only act as relevant descriptors useful for retrieval, but also encode the time of relevance. With Tempas we tackle the challenge of temporally searching a Web archive by indexing tags and time. We allow temporal selections for search terms, rank documents based on their popularity and also provide meaningful query recommendations by exploiting tag-tag and tag-document co-occurrence statistics in arbitrary time windows. Finally, Tempas operates as a fairly non-invasive indexing framework. By not dealing with contents from the actual Web archive it constitutes an attractive and low-overhead approach for quick access into Web archives.

Keywords

    cs.IR, web archives, search, temporal

ASJC Scopus subject areas

Cite this

Tempas: Temporal Archive Search Based on Tags. / Holzmann, Helge; Anand, Avishek.
WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web. 2016. p. 207-210.

Research output: Chapter in book/report/conference proceedingConference contributionResearch

Holzmann, H & Anand, A 2016, Tempas: Temporal Archive Search Based on Tags. in WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web. pp. 207-210, 25th World Wide Web Conference, Montréal, Canada, 11 Apr 2016. https://doi.org/10.1145/2872518.2890555
Holzmann, H., & Anand, A. (2016). Tempas: Temporal Archive Search Based on Tags. In WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web (pp. 207-210) https://doi.org/10.1145/2872518.2890555
Holzmann H, Anand A. Tempas: Temporal Archive Search Based on Tags. In WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web. 2016. p. 207-210 doi: 10.1145/2872518.2890555
Holzmann, Helge ; Anand, Avishek. / Tempas: Temporal Archive Search Based on Tags. WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web. 2016. pp. 207-210
Download
@inproceedings{a30108f2b4064061bf3305708caa3fa7,
title = "Tempas: Temporal Archive Search Based on Tags",
abstract = " Limited search and access patterns over Web archives have been well documented. One of the key reasons is the lack of understanding of the user access patterns over such collections, which in turn is attributed to the lack of effective search interfaces. Current search interfaces for Web archives are (a) either purely navigational or (b) have sub-optimal search experience due to ineffective retrieval models or query modeling. We identify that external longitudinal resources, such as social bookmarking data, are crucial sources to identify important and popular websites in the past. To this extent we present Tempas, a tag-based temporal search engine for Web archives. Websites are posted at specific times of interest on several external platforms, such as bookmarking sites like Delicious. Attached tags not only act as relevant descriptors useful for retrieval, but also encode the time of relevance. With Tempas we tackle the challenge of temporally searching a Web archive by indexing tags and time. We allow temporal selections for search terms, rank documents based on their popularity and also provide meaningful query recommendations by exploiting tag-tag and tag-document co-occurrence statistics in arbitrary time windows. Finally, Tempas operates as a fairly non-invasive indexing framework. By not dealing with contents from the actual Web archive it constitutes an attractive and low-overhead approach for quick access into Web archives. ",
keywords = "cs.IR, web archives, search, temporal",
author = "Helge Holzmann and Avishek Anand",
note = "Funding information: ?This work is partly funded by the European Research Council under ALEXANDRIA (ERC 339233); 25th World Wide Web Conference ; Conference date: 11-04-2016 Through 15-04-2016",
year = "2016",
doi = "10.1145/2872518.2890555",
language = "English",
isbn = "978-1-4503-4144-8",
pages = "207--210",
booktitle = "WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web",

}

Download

TY - GEN

T1 - Tempas: Temporal Archive Search Based on Tags

AU - Holzmann, Helge

AU - Anand, Avishek

N1 - Funding information: ?This work is partly funded by the European Research Council under ALEXANDRIA (ERC 339233)

PY - 2016

Y1 - 2016

N2 - Limited search and access patterns over Web archives have been well documented. One of the key reasons is the lack of understanding of the user access patterns over such collections, which in turn is attributed to the lack of effective search interfaces. Current search interfaces for Web archives are (a) either purely navigational or (b) have sub-optimal search experience due to ineffective retrieval models or query modeling. We identify that external longitudinal resources, such as social bookmarking data, are crucial sources to identify important and popular websites in the past. To this extent we present Tempas, a tag-based temporal search engine for Web archives. Websites are posted at specific times of interest on several external platforms, such as bookmarking sites like Delicious. Attached tags not only act as relevant descriptors useful for retrieval, but also encode the time of relevance. With Tempas we tackle the challenge of temporally searching a Web archive by indexing tags and time. We allow temporal selections for search terms, rank documents based on their popularity and also provide meaningful query recommendations by exploiting tag-tag and tag-document co-occurrence statistics in arbitrary time windows. Finally, Tempas operates as a fairly non-invasive indexing framework. By not dealing with contents from the actual Web archive it constitutes an attractive and low-overhead approach for quick access into Web archives.

AB - Limited search and access patterns over Web archives have been well documented. One of the key reasons is the lack of understanding of the user access patterns over such collections, which in turn is attributed to the lack of effective search interfaces. Current search interfaces for Web archives are (a) either purely navigational or (b) have sub-optimal search experience due to ineffective retrieval models or query modeling. We identify that external longitudinal resources, such as social bookmarking data, are crucial sources to identify important and popular websites in the past. To this extent we present Tempas, a tag-based temporal search engine for Web archives. Websites are posted at specific times of interest on several external platforms, such as bookmarking sites like Delicious. Attached tags not only act as relevant descriptors useful for retrieval, but also encode the time of relevance. With Tempas we tackle the challenge of temporally searching a Web archive by indexing tags and time. We allow temporal selections for search terms, rank documents based on their popularity and also provide meaningful query recommendations by exploiting tag-tag and tag-document co-occurrence statistics in arbitrary time windows. Finally, Tempas operates as a fairly non-invasive indexing framework. By not dealing with contents from the actual Web archive it constitutes an attractive and low-overhead approach for quick access into Web archives.

KW - cs.IR

KW - web archives

KW - search

KW - temporal

UR - http://www.scopus.com/inward/record.url?scp=85027998924&partnerID=8YFLogxK

U2 - 10.1145/2872518.2890555

DO - 10.1145/2872518.2890555

M3 - Conference contribution

SN - 978-1-4503-4144-8

SP - 207

EP - 210

BT - WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web

T2 - 25th World Wide Web Conference

Y2 - 11 April 2016 through 15 April 2016

ER -