Loading [MathJax]/extensions/tex2jax.js

Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Enrique Iglesias
  • Ahmad Sakor
  • Philipp D. Rohde
  • Valentina Janev
  • Maria Esther Vidal

External Research Organisations

  • German National Library of Science and Technology (TIB)
  • University of Belgrade

Details

Original languageEnglish
Title of host publicationSemantic Intelligence
Subtitle of host publicationSelect Proceedings of ISIC 2023
EditorsSarika Jain, Nandana Mihindukulasooriya, Valentina Janev, Cogan Matthew Shimizu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages11-23
Number of pages13
ISBN (electronic)978-981-97-7356-5
ISBN (print)9789819773558
Publication statusPublished - 29 Dec 2024
Event3rd International Semantic Intelligence Conference, ISIC 2023 - Braga, Portugal
Duration: 17 Oct 202319 Oct 2023

Publication series

NameLecture Notes in Electrical Engineering
Volume1258
ISSN (Print)1876-1100
ISSN (electronic)1876-1119

Abstract

In recent years, the exponential growth of data has necessitated a unified schema to harmonize diverse data sources. This is where knowledge graphs (KGs) come into play. However, the creation of KGs introduces new challenges, such as handling large and heterogeneous input data and complex mappings. These challenges can lead to reduced scalability due to the significant memory consumption and extended execution times involved. We present KatanaG, a framework designed to streamline KG creation in complex scenarios, including large data sources and intricate mapping. KatanaG optimizes memory usage and execution time. When applied alongside various KG creation engines, our results indicate that KatanaG can improve the performance of these engines, by reducing execution time by up to 80% and achieve 70% memory savings.

Keywords

    Data fragmentation, Knowledge graph creation

ASJC Scopus subject areas

Cite this

Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation. / Iglesias, Enrique; Sakor, Ahmad; Rohde, Philipp D. et al.
Semantic Intelligence : Select Proceedings of ISIC 2023. ed. / Sarika Jain; Nandana Mihindukulasooriya; Valentina Janev; Cogan Matthew Shimizu. Springer Science and Business Media Deutschland GmbH, 2024. p. 11-23 (Lecture Notes in Electrical Engineering; Vol. 1258).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Iglesias, E, Sakor, A, Rohde, PD, Janev, V & Vidal, ME 2024, Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation. in S Jain, N Mihindukulasooriya, V Janev & CM Shimizu (eds), Semantic Intelligence : Select Proceedings of ISIC 2023. Lecture Notes in Electrical Engineering, vol. 1258, Springer Science and Business Media Deutschland GmbH, pp. 11-23, 3rd International Semantic Intelligence Conference, ISIC 2023, Braga, Portugal, 17 Oct 2023. https://doi.org/10.1007/978-981-97-7356-5_2
Iglesias, E., Sakor, A., Rohde, P. D., Janev, V., & Vidal, M. E. (2024). Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation. In S. Jain, N. Mihindukulasooriya, V. Janev, & C. M. Shimizu (Eds.), Semantic Intelligence : Select Proceedings of ISIC 2023 (pp. 11-23). (Lecture Notes in Electrical Engineering; Vol. 1258). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-97-7356-5_2
Iglesias E, Sakor A, Rohde PD, Janev V, Vidal ME. Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation. In Jain S, Mihindukulasooriya N, Janev V, Shimizu CM, editors, Semantic Intelligence : Select Proceedings of ISIC 2023. Springer Science and Business Media Deutschland GmbH. 2024. p. 11-23. (Lecture Notes in Electrical Engineering). doi: 10.1007/978-981-97-7356-5_2
Iglesias, Enrique ; Sakor, Ahmad ; Rohde, Philipp D. et al. / Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation. Semantic Intelligence : Select Proceedings of ISIC 2023. editor / Sarika Jain ; Nandana Mihindukulasooriya ; Valentina Janev ; Cogan Matthew Shimizu. Springer Science and Business Media Deutschland GmbH, 2024. pp. 11-23 (Lecture Notes in Electrical Engineering).
Download
@inproceedings{9f6f59b631ce491aa22ce6af7f617b45,
title = "Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation",
abstract = "In recent years, the exponential growth of data has necessitated a unified schema to harmonize diverse data sources. This is where knowledge graphs (KGs) come into play. However, the creation of KGs introduces new challenges, such as handling large and heterogeneous input data and complex mappings. These challenges can lead to reduced scalability due to the significant memory consumption and extended execution times involved. We present KatanaG, a framework designed to streamline KG creation in complex scenarios, including large data sources and intricate mapping. KatanaG optimizes memory usage and execution time. When applied alongside various KG creation engines, our results indicate that KatanaG can improve the performance of these engines, by reducing execution time by up to 80% and achieve 70% memory savings.",
keywords = "Data fragmentation, Knowledge graph creation",
author = "Enrique Iglesias and Ahmad Sakor and Rohde, {Philipp D.} and Valentina Janev and Vidal, {Maria Esther}",
note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.; 3rd International Semantic Intelligence Conference, ISIC 2023 ; Conference date: 17-10-2023 Through 19-10-2023",
year = "2024",
month = dec,
day = "29",
doi = "10.1007/978-981-97-7356-5_2",
language = "English",
isbn = "9789819773558",
series = "Lecture Notes in Electrical Engineering",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "11--23",
editor = "Sarika Jain and Nandana Mihindukulasooriya and Valentina Janev and Shimizu, {Cogan Matthew}",
booktitle = "Semantic Intelligence",
address = "Germany",

}

Download

TY - GEN

T1 - Fragmenting Data Strategies to Scale Up the Knowledge Graph Creation

AU - Iglesias, Enrique

AU - Sakor, Ahmad

AU - Rohde, Philipp D.

AU - Janev, Valentina

AU - Vidal, Maria Esther

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

PY - 2024/12/29

Y1 - 2024/12/29

N2 - In recent years, the exponential growth of data has necessitated a unified schema to harmonize diverse data sources. This is where knowledge graphs (KGs) come into play. However, the creation of KGs introduces new challenges, such as handling large and heterogeneous input data and complex mappings. These challenges can lead to reduced scalability due to the significant memory consumption and extended execution times involved. We present KatanaG, a framework designed to streamline KG creation in complex scenarios, including large data sources and intricate mapping. KatanaG optimizes memory usage and execution time. When applied alongside various KG creation engines, our results indicate that KatanaG can improve the performance of these engines, by reducing execution time by up to 80% and achieve 70% memory savings.

AB - In recent years, the exponential growth of data has necessitated a unified schema to harmonize diverse data sources. This is where knowledge graphs (KGs) come into play. However, the creation of KGs introduces new challenges, such as handling large and heterogeneous input data and complex mappings. These challenges can lead to reduced scalability due to the significant memory consumption and extended execution times involved. We present KatanaG, a framework designed to streamline KG creation in complex scenarios, including large data sources and intricate mapping. KatanaG optimizes memory usage and execution time. When applied alongside various KG creation engines, our results indicate that KatanaG can improve the performance of these engines, by reducing execution time by up to 80% and achieve 70% memory savings.

KW - Data fragmentation

KW - Knowledge graph creation

UR - http://www.scopus.com/inward/record.url?scp=85215266783&partnerID=8YFLogxK

U2 - 10.1007/978-981-97-7356-5_2

DO - 10.1007/978-981-97-7356-5_2

M3 - Conference contribution

AN - SCOPUS:85215266783

SN - 9789819773558

T3 - Lecture Notes in Electrical Engineering

SP - 11

EP - 23

BT - Semantic Intelligence

A2 - Jain, Sarika

A2 - Mihindukulasooriya, Nandana

A2 - Janev, Valentina

A2 - Shimizu, Cogan Matthew

PB - Springer Science and Business Media Deutschland GmbH

T2 - 3rd International Semantic Intelligence Conference, ISIC 2023

Y2 - 17 October 2023 through 19 October 2023

ER -