MapSDI: A scaled-up semantic data integration framework for knowledge graph creation

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Samaneh Jozashoori
  • Maria Esther Vidal

Organisationseinheiten

Externe Organisationen

  • Technische Informationsbibliothek (TIB) Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksOn the Move to Meaningful Internet Systems
UntertitelOTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings
Herausgeber/-innenHervé Panetto, Christophe Debruyne, Dave Lewis, Martin Hepp, Claudio Agostino Ardagna, Robert Meersman
Seiten58-75
Seitenumfang18
Auflage1.
ISBN (elektronisch)978-3-030-33246-4
PublikationsstatusVeröffentlicht - 11 Okt. 2019
VeranstaltungConfederated International Conferences on Cooperative Information Systems, CoopIS 2019, Ontologies, Databases, and Applications of Semantics, ODBASE 2019, and Cloud and Trusted Computing, C and TC, held as part of OTM 2019 - Rhodes, Griechenland
Dauer: 21 Okt. 201925 Okt. 2019

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band11877 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Abstract

Semantic web technologies have significantly contributed with effective solutions for the problems of data integration and knowledge graph creation. However, with the rapid growth of big data in diverse domains, different interoperability issues still demand to be addressed, being scalability one of the main challenges. In this paper, we address the problem of knowledge graph creation at scale and provide MapSDI, a mapping rule-based framework for optimizing semantic data integration into knowledge graphs. MapSDI allows for the semantic enrichment of large-sized, heterogeneous, and potentially low-quality data efficiently. The input of MapSDI is a set of data sources and mapping rules being generated by a mapping language such as RML. First, MapSDI pre-processes the sources based on semantic information extracted from mapping rules, by performing basic database operators; it projects out required attributes, eliminates duplicates, and selects relevant entries. All these operators are defined based on the knowledge encoded by the mapping rules which will be then used by the semantification engine (or RDFizer) to produce a knowledge graph. We have empirically studied the impact of MapSDI on existing RDFizers, and observed that knowledge graph creation time can be reduced on average in one order of magnitude. It is also shown, theoretically, that the sources and rules transformations provided by MapSDI are data-lossless.

ASJC Scopus Sachgebiete

Zitieren

MapSDI: A scaled-up semantic data integration framework for knowledge graph creation. / Jozashoori, Samaneh; Vidal, Maria Esther.
On the Move to Meaningful Internet Systems: OTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings. Hrsg. / Hervé Panetto; Christophe Debruyne; Dave Lewis; Martin Hepp; Claudio Agostino Ardagna; Robert Meersman. 1. Aufl. 2019. S. 58-75 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 11877 LNCS).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Jozashoori, S & Vidal, ME 2019, MapSDI: A scaled-up semantic data integration framework for knowledge graph creation. in H Panetto, C Debruyne, D Lewis, M Hepp, CA Ardagna & R Meersman (Hrsg.), On the Move to Meaningful Internet Systems: OTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings. 1. Aufl., Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 11877 LNCS, S. 58-75, Confederated International Conferences on Cooperative Information Systems, CoopIS 2019, Ontologies, Databases, and Applications of Semantics, ODBASE 2019, and Cloud and Trusted Computing, C and TC, held as part of OTM 2019, Rhodes, Griechenland, 21 Okt. 2019. https://doi.org/10.1007/978-3-030-33246-4_4
Jozashoori, S., & Vidal, M. E. (2019). MapSDI: A scaled-up semantic data integration framework for knowledge graph creation. In H. Panetto, C. Debruyne, D. Lewis, M. Hepp, C. A. Ardagna, & R. Meersman (Hrsg.), On the Move to Meaningful Internet Systems: OTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings (1. Aufl., S. 58-75). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 11877 LNCS). https://doi.org/10.1007/978-3-030-33246-4_4
Jozashoori S, Vidal ME. MapSDI: A scaled-up semantic data integration framework for knowledge graph creation. in Panetto H, Debruyne C, Lewis D, Hepp M, Ardagna CA, Meersman R, Hrsg., On the Move to Meaningful Internet Systems: OTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings. 1. Aufl. 2019. S. 58-75. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-33246-4_4
Jozashoori, Samaneh ; Vidal, Maria Esther. / MapSDI : A scaled-up semantic data integration framework for knowledge graph creation. On the Move to Meaningful Internet Systems: OTM 2019 Conferences - Confederated International Conferences: CoopIS, ODBASE, C and TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings. Hrsg. / Hervé Panetto ; Christophe Debruyne ; Dave Lewis ; Martin Hepp ; Claudio Agostino Ardagna ; Robert Meersman. 1. Aufl. 2019. S. 58-75 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{42b3f87b871746f284631f37aa1a8f9d,
title = "MapSDI: A scaled-up semantic data integration framework for knowledge graph creation",
abstract = "Semantic web technologies have significantly contributed with effective solutions for the problems of data integration and knowledge graph creation. However, with the rapid growth of big data in diverse domains, different interoperability issues still demand to be addressed, being scalability one of the main challenges. In this paper, we address the problem of knowledge graph creation at scale and provide MapSDI, a mapping rule-based framework for optimizing semantic data integration into knowledge graphs. MapSDI allows for the semantic enrichment of large-sized, heterogeneous, and potentially low-quality data efficiently. The input of MapSDI is a set of data sources and mapping rules being generated by a mapping language such as RML. First, MapSDI pre-processes the sources based on semantic information extracted from mapping rules, by performing basic database operators; it projects out required attributes, eliminates duplicates, and selects relevant entries. All these operators are defined based on the knowledge encoded by the mapping rules which will be then used by the semantification engine (or RDFizer) to produce a knowledge graph. We have empirically studied the impact of MapSDI on existing RDFizers, and observed that knowledge graph creation time can be reduced on average in one order of magnitude. It is also shown, theoretically, that the sources and rules transformations provided by MapSDI are data-lossless.",
keywords = "Data integration system, Knowledge graph creation, Semantic data integration, Transformation rules",
author = "Samaneh Jozashoori and Vidal, {Maria Esther}",
note = "Funding information: This work has been partially funded by the EU H2020 Program for the Project No. 727658 (IASIS).; Confederated International Conferences on Cooperative Information Systems, CoopIS 2019, Ontologies, Databases, and Applications of Semantics, ODBASE 2019, and Cloud and Trusted Computing, C and TC, held as part of OTM 2019 ; Conference date: 21-10-2019 Through 25-10-2019",
year = "2019",
month = oct,
day = "11",
doi = "10.1007/978-3-030-33246-4_4",
language = "English",
isbn = "978-3-030-33245-7",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "58--75",
editor = "Herv{\'e} Panetto and Christophe Debruyne and Dave Lewis and Martin Hepp and Ardagna, {Claudio Agostino} and Robert Meersman",
booktitle = "On the Move to Meaningful Internet Systems",
edition = "1.",

}

Download

TY - GEN

T1 - MapSDI

T2 - Confederated International Conferences on Cooperative Information Systems, CoopIS 2019, Ontologies, Databases, and Applications of Semantics, ODBASE 2019, and Cloud and Trusted Computing, C and TC, held as part of OTM 2019

AU - Jozashoori, Samaneh

AU - Vidal, Maria Esther

N1 - Funding information: This work has been partially funded by the EU H2020 Program for the Project No. 727658 (IASIS).

PY - 2019/10/11

Y1 - 2019/10/11

N2 - Semantic web technologies have significantly contributed with effective solutions for the problems of data integration and knowledge graph creation. However, with the rapid growth of big data in diverse domains, different interoperability issues still demand to be addressed, being scalability one of the main challenges. In this paper, we address the problem of knowledge graph creation at scale and provide MapSDI, a mapping rule-based framework for optimizing semantic data integration into knowledge graphs. MapSDI allows for the semantic enrichment of large-sized, heterogeneous, and potentially low-quality data efficiently. The input of MapSDI is a set of data sources and mapping rules being generated by a mapping language such as RML. First, MapSDI pre-processes the sources based on semantic information extracted from mapping rules, by performing basic database operators; it projects out required attributes, eliminates duplicates, and selects relevant entries. All these operators are defined based on the knowledge encoded by the mapping rules which will be then used by the semantification engine (or RDFizer) to produce a knowledge graph. We have empirically studied the impact of MapSDI on existing RDFizers, and observed that knowledge graph creation time can be reduced on average in one order of magnitude. It is also shown, theoretically, that the sources and rules transformations provided by MapSDI are data-lossless.

AB - Semantic web technologies have significantly contributed with effective solutions for the problems of data integration and knowledge graph creation. However, with the rapid growth of big data in diverse domains, different interoperability issues still demand to be addressed, being scalability one of the main challenges. In this paper, we address the problem of knowledge graph creation at scale and provide MapSDI, a mapping rule-based framework for optimizing semantic data integration into knowledge graphs. MapSDI allows for the semantic enrichment of large-sized, heterogeneous, and potentially low-quality data efficiently. The input of MapSDI is a set of data sources and mapping rules being generated by a mapping language such as RML. First, MapSDI pre-processes the sources based on semantic information extracted from mapping rules, by performing basic database operators; it projects out required attributes, eliminates duplicates, and selects relevant entries. All these operators are defined based on the knowledge encoded by the mapping rules which will be then used by the semantification engine (or RDFizer) to produce a knowledge graph. We have empirically studied the impact of MapSDI on existing RDFizers, and observed that knowledge graph creation time can be reduced on average in one order of magnitude. It is also shown, theoretically, that the sources and rules transformations provided by MapSDI are data-lossless.

KW - Data integration system

KW - Knowledge graph creation

KW - Semantic data integration

KW - Transformation rules

UR - http://www.scopus.com/inward/record.url?scp=85077889555&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-33246-4_4

DO - 10.1007/978-3-030-33246-4_4

M3 - Conference contribution

AN - SCOPUS:85077889555

SN - 978-3-030-33245-7

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 58

EP - 75

BT - On the Move to Meaningful Internet Systems

A2 - Panetto, Hervé

A2 - Debruyne, Christophe

A2 - Lewis, Dave

A2 - Hepp, Martin

A2 - Ardagna, Claudio Agostino

A2 - Meersman, Robert

Y2 - 21 October 2019 through 25 October 2019

ER -