Details
Originalsprache | Englisch |
---|---|
Seiten (von - bis) | 149-167 |
Seitenumfang | 19 |
Fachzeitschrift | International Journal on Digital Libraries |
Jahrgang | 21 |
Ausgabenummer | 2 |
Frühes Online-Datum | 5 Juli 2018 |
Publikationsstatus | Veröffentlicht - Juni 2020 |
Abstract
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.
ASJC Scopus Sachgebiete
- Sozialwissenschaften (insg.)
- Bibliotheks- und Informationswissenschaften
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
in: International Journal on Digital Libraries, Jahrgang 21, Nr. 2, 06.2020, S. 149-167.
Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review
}
TY - JOUR
T1 - Building and querying semantic layers for web archives (extended version)
AU - Fafalios, Pavlos
AU - Holzmann, Helge
AU - Kasturia, Vaibhav
AU - Nejdl, Wolfgang
N1 - Funding information: The work was partially funded by the European Commission for the ERC Advanced Grant ALEXANDRIA (No. 339233).
PY - 2020/6
Y1 - 2020/6
N2 - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.
AB - Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.
KW - Exploratory search
KW - Linked data
KW - Profiling
KW - Semantic layer
KW - Web archives
UR - http://www.scopus.com/inward/record.url?scp=85049598774&partnerID=8YFLogxK
U2 - 10.48550/arXiv.1810.10455
DO - 10.48550/arXiv.1810.10455
M3 - Article
AN - SCOPUS:85049598774
VL - 21
SP - 149
EP - 167
JO - International Journal on Digital Libraries
JF - International Journal on Digital Libraries
SN - 1432-5012
IS - 2
ER -