Building and querying semantic layers for web archives (extended version)

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

Organisationseinheiten

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)149-167
Seitenumfang19
FachzeitschriftInternational Journal on Digital Libraries
Jahrgang21
Ausgabenummer2
Frühes Online-Datum5 Juli 2018
PublikationsstatusVeröffentlicht - Juni 2020

Abstract

Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

ASJC Scopus Sachgebiete

Zitieren

Building and querying semantic layers for web archives (extended version). / Fafalios, Pavlos; Holzmann, Helge; Kasturia, Vaibhav et al.
in: International Journal on Digital Libraries, Jahrgang 21, Nr. 2, 06.2020, S. 149-167.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Fafalios, P, Holzmann, H, Kasturia, V & Nejdl, W 2020, 'Building and querying semantic layers for web archives (extended version)', International Journal on Digital Libraries, Jg. 21, Nr. 2, S. 149-167. https://doi.org/10.48550/arXiv.1810.10455, https://doi.org/10.1007/s00799-018-0251-0
Fafalios, P., Holzmann, H., Kasturia, V., & Nejdl, W. (2020). Building and querying semantic layers for web archives (extended version). International Journal on Digital Libraries, 21(2), 149-167. https://doi.org/10.48550/arXiv.1810.10455, https://doi.org/10.1007/s00799-018-0251-0
Fafalios P, Holzmann H, Kasturia V, Nejdl W. Building and querying semantic layers for web archives (extended version). International Journal on Digital Libraries. 2020 Jun;21(2):149-167. Epub 2018 Jul 5. doi: 10.48550/arXiv.1810.10455, 10.1007/s00799-018-0251-0
Fafalios, Pavlos ; Holzmann, Helge ; Kasturia, Vaibhav et al. / Building and querying semantic layers for web archives (extended version). in: International Journal on Digital Libraries. 2020 ; Jahrgang 21, Nr. 2. S. 149-167.
Download
@article{3b1079ce58024008992494411e387ede,
title = "Building and querying semantic layers for web archives (extended version)",
abstract = "Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.",
keywords = "Exploratory search, Linked data, Profiling, Semantic layer, Web archives",
author = "Pavlos Fafalios and Helge Holzmann and Vaibhav Kasturia and Wolfgang Nejdl",
note = "Funding information: The work was partially funded by the European Commission for the ERC Advanced Grant ALEXANDRIA (No. 339233).",
year = "2020",
month = jun,
doi = "10.48550/arXiv.1810.10455",
language = "English",
volume = "21",
pages = "149--167",
number = "2",

}

Download

TY - JOUR

T1 - Building and querying semantic layers for web archives (extended version)

AU - Fafalios, Pavlos

AU - Holzmann, Helge

AU - Kasturia, Vaibhav

AU - Nejdl, Wolfgang

N1 - Funding information: The work was partially funded by the European Commission for the ERC Advanced Grant ALEXANDRIA (No. 339233).

PY - 2020/6

Y1 - 2020/6

N2 - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

AB - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

KW - Exploratory search

KW - Linked data

KW - Profiling

KW - Semantic layer

KW - Web archives

UR - http://www.scopus.com/inward/record.url?scp=85049598774&partnerID=8YFLogxK

U2 - 10.48550/arXiv.1810.10455

DO - 10.48550/arXiv.1810.10455

M3 - Article

AN - SCOPUS:85049598774

VL - 21

SP - 149

EP - 167

JO - International Journal on Digital Libraries

JF - International Journal on Digital Libraries

SN - 1432-5012

IS - 2

ER -

Von denselben Autoren