Building and Querying Semantic Layers for Web Archives

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publication2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (electronic)9781538638613
Publication statusPublished - 25 Jul 2017
Event17th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL) - Toronto, Canada
Duration: 19 Jun 201723 Jun 2017

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
ISSN (Print)1552-5996

Abstract

Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (layers) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

ASJC Scopus subject areas

Cite this

Building and Querying Semantic Layers for Web Archives. / Fafalios, Pavlos; Holzmann, Helge; Kasturia, Vaibhav et al.
2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017. Institute of Electrical and Electronics Engineers Inc., 2017. 7991555 (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Fafalios, P, Holzmann, H, Kasturia, V & Nejdl, W 2017, Building and Querying Semantic Layers for Web Archives. in 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017., 7991555, Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Institute of Electrical and Electronics Engineers Inc., 17th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Toronto, Canada, 19 Jun 2017. https://doi.org/10.1109/jcdl.2017.7991555
Fafalios, P., Holzmann, H., Kasturia, V., & Nejdl, W. (2017). Building and Querying Semantic Layers for Web Archives. In 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017 Article 7991555 (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/jcdl.2017.7991555
Fafalios P, Holzmann H, Kasturia V, Nejdl W. Building and Querying Semantic Layers for Web Archives. In 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017. Institute of Electrical and Electronics Engineers Inc. 2017. 7991555. (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries). doi: 10.1109/jcdl.2017.7991555
Fafalios, Pavlos ; Holzmann, Helge ; Kasturia, Vaibhav et al. / Building and Querying Semantic Layers for Web Archives. 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017. Institute of Electrical and Electronics Engineers Inc., 2017. (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries).
Download
@inproceedings{d89e01c583d4482d9038df5cd136cb93,
title = "Building and Querying Semantic Layers for Web Archives",
abstract = "Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (layers) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.",
author = "Pavlos Fafalios and Helge Holzmann and Vaibhav Kasturia and Wolfgang Nejdl",
note = "Funding information:. The work was partially funded by the European Commission for the ERC Advanced Grant ALEXANDRIA (No. 339233).; 17th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), JCDL ; Conference date: 19-06-2017 Through 23-06-2017",
year = "2017",
month = jul,
day = "25",
doi = "10.1109/jcdl.2017.7991555",
language = "English",
series = "Proceedings of the ACM/IEEE Joint Conference on Digital Libraries",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017",
address = "United States",

}

Download

TY - GEN

T1 - Building and Querying Semantic Layers for Web Archives

AU - Fafalios, Pavlos

AU - Holzmann, Helge

AU - Kasturia, Vaibhav

AU - Nejdl, Wolfgang

N1 - Funding information:. The work was partially funded by the European Commission for the ERC Advanced Grant ALEXANDRIA (No. 339233).

PY - 2017/7/25

Y1 - 2017/7/25

N2 - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (layers) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

AB - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (layers) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

UR - http://www.scopus.com/inward/record.url?scp=85027982024&partnerID=8YFLogxK

U2 - 10.1109/jcdl.2017.7991555

DO - 10.1109/jcdl.2017.7991555

M3 - Conference contribution

AN - SCOPUS:85027982024

T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries

BT - 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 17th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL)

Y2 - 19 June 2017 through 23 June 2017

ER -

By the same author(s)